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Abstract 

Interference is a fundamental feature of the wireless channel. Cooperation among 
the radios has been shown to manage interference in a multiple unicast wireless network 
in a near optimal scaling law sense In this paper, we study the two user Gaussian 
interference channel where the source nodes can both talk and listen. Focusing on the 
full-duplex mode of operation, we characterize the sum capacity up to 18 bits. Novel 
inner and outer bounds are derived in arriving at our main result. 

1 Introduction 

The standard engineering approach to dealing with interference in wireless systems is to 
orthogonalize the transmissions and/or treat interference as noise at the receivers. However, 
these strategies can be far from optimal in several canonical scenarios, including the clas- 
sical two user Gaussian interference channel [U |2]. Superposition coding and interference 
alignment has been shown to perform well in interference channels (where the sources only 
transmit and destinations only receive) [21 [3]. 

A different approach to interference management is available when the wireless nodes 
can cooperate among themselves (this situation is not feasible in the classical interference 
channel when sources only transmit and destinations only receive). A coarse result (scaling 
of symmetric capacity as the number of radios grows) derived in [3] shows that distributed 
cooperation (a so-called hierarchical MIMO strategy) can manage interference between the 
different traffic flows so well as to get near the performance of (co-located) MIMO operation 
among the nodes. While this is a strong result, it is coarse - in the asymptotic regime of a 
very large number of radios. 

Our goal in this paper is to better understand the role of cooperation in providing in- 
terference management gains. We take a fundamental (information theoretic) approach by 
studying the capacity region of a simple (yet, canonical) wireless network: we start with the 
classical two user Gaussian interference channel, but endow the sources with the capability 



to listen as well (apart from the usual capability of transmission). The cooperative links are 
over the same frequency band as the rest of the links. We treat only the sum-rate under 
a full-duplex mode of operation in this paper. The main result is a characterization of the 
sum-rate within 18 bits. 

Our main focus while deriving our main result is on constructing novel achievable schemes 
as well as new converse techniques. Our achievable scheme is generic, in the sense that it 
can be described in the context of a general discrete memoryless interference channel with 
source cooperation. This general nature of the scheme (as well as the need to calculate a 
uniform gap between our inner and outer bounds) results in a large gap (presently 18 bits). 
In specific instances our characterization is readily sharpened - we provide an example where 
the gap reduces to 6 bits, for instance. We also provide a few general recipes to improve the 
inner and outer bounds (though calculating the improvement explicitly is rather involved). 

The superposition scheme of Han and Kobayashi [3] for the two user interference channel 
involves the two destinations partially decoding the interference they receive. In order to 
facilitate this, the sources encode their messages as a superposition of two partial messages. 
One of these partial messages, termed the public message, is decoded by the destination 
where it appears as interference along with the two partial messages which are meant for 
this destination. The other partial message, called the private message, from the interfering 
source is treated as noise. Our achievable scheme preserves these two types of messages and 
employs a further two types of messages: 

• a cooperative-public message which is decoded not only by both the destinations, but 
also by the other source which aids its own destination in decoding it, and 

• a cooperative-private message which is decoded by the destination to which it is intended 
and by the other source which cooperates with the original source in its transmission. 

These two messages have similarities to the suggestions of Tuninetti [6j , Cao and Chen [7] , 
and Yang and Tuninetti jH], but differ in the details of implementation which have a bearing 
on the rates achieved. 

Other related works include [9] which studies the two-user interference channel under co- 
operation, [10] which investigates the symmetric two-user interference channel under noiseless 
feedback, and [UJ [12] which study a two-stage, two-source interference network. 

2 Problem Statement 

We consider the following channel model for source cooperation (see Figure [I]). At each 
discrete-time instance - indexed by t = 1, 2, ... - the source nodes 1 and 2 send out, respec- 
tively, Xi(t) and X 2 {t) G C. The source nodes 1 and 2, and the destination nodes 3 and 4 
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Figure 1: Problem Setup 



receive respectively 

Y 1 (t) = h 2 ,iX 2 (t) + N 1 (t), 

Y 2 {t) = h 1 , 2 X 1 {t) + N 2 {t), 

Y 3 (t) = h^X^t) + h 2 , 3 X 2 (t) + N 3 (t), 

Y 4 (t) = h 2A X 2 (t) + h^X^t) + N A {t), 

where the channel coefficients /i's are complex numbers and N k (t), k — 1, 2, 3, 4, t = 1,2,... 
are independent and identically distributed (i.i.d.) zero-mean Gaussian random variables 
with unit variance. It is easy to see that, without loss of generality, we may consider a 
channel where the channel coefficients ^1,3,^1,2,^2,4,^2,1 are replaced by their magnitudes 
I ^1,3 1 5 l^i,2|, I ^2,4 1, 1 h 2 ,i I, and the channel coefficient hi^ is replaced by \h\^\e^ e ^ 2 and h 2>3 is 
replaced by \h 2j3 \e je ^ 2 , where 9 = f arg(/ii, 4 ) +arg(/i 2 ,3) — arg(/ii, 3 ) — arg(/i 2 ,4). We will consider 
this channel. We will also assume that \hi t2 \ = \h 2t i\ = he, say which models the reciprocity 
of the link between nodes 1 and 2. Further, we consider unit power constraints without loss 
of generality. 

There is a causality restriction on what the sources are allowed to transmit: it can only 
depend on the message it sends and everything it has heard up to the previous time instance, 
i.e., 

X k (t) = f k , t (M k ,Yt 1 ), fc = l,2, 
where M k is the message to be conveyed by source k and / is a (deterministic) encoding 
function. A blocklength-T codebook of rate (Ri,R 2 ) is (for each k — 1,2) a sequence of 
encoding functions, f k ,t, t — 1,2, ... , T such that 



E 



T 



T 



*=i 



< 1, 



with message alphabets M. k = {1,2, .. . ,2 TRk } over which the messages M k are uniformly 
distributed, and decoding functions g k+2 : C T — > .Mfc. We say that a rate (Ri,R 2 ) is 
achievable if there is sequence of rate (R±, R 2 ) codebooks such that as T — > 00, 



P (^+2(^+2) ^ M fc ) - 0, A; = 1,2. 
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We would also like to consider a linear deterministic model (introduced in [13J) for the 
above channel. In order to treat both models together, we will adopt the following: 

Y 1 (t) = hl 1 (X 2 (t)), 
Y 2 (t) = hl 2 (X 1 (t)), 
Y 3 (t)=h lt3 (X 1 (t)) + hZ f3 (X 2 (t)), 
Y A {t) = h 2A {X 2 {t)) + h\ A {X l {t)). 

Here the functions with a * are potentially random functions and the others are deterministic 
functions. 

Gaussian case: In the Gaussian case, we specialize to (with some abuse of notation^): 

hl tl (X 2 ) = h 2 ,iX 2 + N h 
hl 2 (X 1 ) = h h2 X 1 + N 2j 

^2,4(^2) = ^2,4-^2? 

h; >3 (X 2 ) = h 2 , 3 X 2 + N 3 , 
h* lA {X x ) = h lA X x + iV 4 . 

We will further assume that hi )2 = h 2i i which is justified by the reciprocity of the links 
between the sources. 

Linear deterministic case: Let rii j2 , n\ t3 , n\ A ,n 2> i, n 2 ^,n 2A be non-negative integers and 
n = f max(ra lj2 , ra 13 , n 2 ,i, ra 2 ,3, ra 2 4 ). The inputs to the channel X x and X 2 are n-length 
vectors over a finite field F. We define 



where 
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1 The correct notation would be Y\(t) = h* 2 \ (X2(t)) = /i2.i^2(i) + Ni(t). This t-index in the notation 

for random functions like h* 2 ^ will be suppressed. We will tacitly assume that application of *-ed functions 
for different values of t result in independent realizations of N's. 
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is the nx n shift matrix. Further, to model the reciprocity of the links between the sources, 
we set n 1)2 = n 2 ,i = n c , say. 



3 Main Results 

3.1 Sum-rate Characterization 

The following theorems characterize the sum-rates of the interference channels with source 
cooperation introduced in the previous section. 

Theorem 1 Linear deterministic case. The sum- capacity of the linear deterministic channel 
with source cooperation is the minimum of the following: 

ui = max(ni i3 - n M + n c , n 2)3 , nc) + max(n 2 , 4 - n 2 , 3 + n c , n 1A , n c ), (1) 

u 2 = max(n 1)3 , n 2j3 ) + (max(n 2j4 , n 2j3 , n c ) - n 2 , 3 ) , (2) 

u 3 = max(n 2i4 , 711,4) + (max(ni j3 , ^1,4, nc) - 0,1,4) , (3) 

w 4 = max(ni i3 ,nc) + max(n 2i4 , nc), and (4) 

_ f max(ni ;3 + n 2i4 ,n li4 + n 2i3 ), !/n 1)3 - n 2j3 7^ n M - n 2 , 4 , 
I max(ni j 3, n 2 ,4, 0,1,4, 712,3), otherwise. 

The achievability of the above theorem in proved in Appendix |Bl and the upperbounds are 



derived in Appendix D 



Theorem 2 Gaussian case. The sum- capacity of the Gaussian channel with source cooper- 
ation is at most the minimum of the following five quantities and a sum-rate can be achieved 
to within a constant (18 bits) of this minimum. 



2^ 



+ 1 -U + (n^^ l<l + l*A (7) 



u 2 = log2 1 + (\h h3 \ + \h 2 , 3 \Y) 1 + " ' ' ' r 2 - ' , (8) 

v ' \ max(l, \h 2 - 3 \ 2 ) J 

/ ,,, I max (\hi 3 | 2 , \h\ 4 | 2 , \hc\ 2 ) \ , s 

u 3 = log2 (l + ( h 2 , 4 + h 1A ) 2 ) 1 + n 11, 12 > 9 

v ' \ max (1, |/h,4n) / 

w 4 = log (l + |/ii i3 | 2 + \h c f) + log (l + \h 2A \ 2 + \h c \ 2 ) , and (10) 
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log ^1 + 2 (|/i li3 | 2 + |/i 2 , 4 | 2 + |M 2 + IM") 



+ 4 (\h 1>3 h 2i i\ +\h 1A h 2t3 \ - 2 1^1,3/1^4/11,4^2,3 1 cos 6>J J . (11) 
The achievability proof is presented in Appendix O and the converse in Appendix [D 
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C D = \og{l + h 2 D ) 



loglH 2 

log'IM 2 



Figure 2: Normalized sum-capacity of the symmetric interference channel with hj = 
y/ho under source cooperation in the limit of h D — > oo keeping log \hi\ 2 / log \h D \ 2 and 
log \h c \ 2 / log \h,E>\ 2 fixed. 
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3.2 Gains from cooperation 

To illustrate the gains from cooperation, in this section we will explore a specific instance 
of the symmetric interference channel: = 1^-2,4 1 — hp, 

9 = 0. In appendix [F] we prove the following proposition. 

Proposition 3 Let C be the minimum of the following 

21og2/i D + 



log 2h 2 D \l + 
log (4h%) . 



ma,x(h 2 D , h 



c) 



h 



D 



For the symmetric channel described above, for any e > 0, the sum-capacity lies in (C — 6 — 
e, C + e), for sufficiently large ho. 



We plot in Fig. |2j as a function of log \ hc\ 2 / log |/io| 2 , the sum-rate C normalized by the ca- 
pacity of the direct link, in the limit of \ho\ — > oo while keeping the ratios log \ hc\ 2 / log \ hr,\ 2 
and log \hi\ 2 / log \h D \ 2 constants. Since C is achievable within a constant gap (6 bits), this 
plot is also that of the sum-capacity in this limit. This reveals three regimes of operation: 

• log \h c \ 2 / log \h D \ 2 < 1/2. In this regime, the plot shows that the capacity increases 
linearly with the strength of the cooperation link (measured in the dB scale). For 
every 3dB increase in link strength the sum-capacity increases by 2 bits. Note that 
without cooperation, the sum-capacity is essentially achieved by time-sharing. Thus, 
cooperation can be seen to deliver significant gains. 

• 1/2 < log \hc\ 2 / log \ho\ 2 < 1. The linear gain saturates when the cooperation link 
strength is half the direct link strength. No further gains are accrued until the coop- 
eration link is as strong as the direct link. 
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• 1 < log \ hc\ 2 / log \ ho\ 2 < 3/2. The capacity again increases linearly with the coopera- 
tion link strength, but here an increase in capacity by 2 bits requires a 6dB increase in 
the cooperation channel strength. This linear increase continues until the cooperation 
capacity is approached when the cooperation link is 3/2 times as strong as the direct 
link, after which the capacity is flat. 

4 Coding Scheme: Illustrative Examples 

In this section, our cooperative coding scheme is illustrated through a few simple examples. 
These examples have been hand-picked so that uncoded (signal processing) strategies are 
enough to achieve the sum-capacity. However, they cover the key novel aspects of our coding 
strategy. In particular, the first example shows how the sources cooperate in conveying 
cooperative-public messages, which are messages decoded by both sources and which benefit 
from cooperation. Example 1 also involves the use of private messages which are decoded 
only by the destinations to which they are intended and get conveyed without the benefits 
of cooperation. Example 2 shows how cooperative-public messages can occur together with 
the two types of messages in Han-Kobayashi's scheme, namely, private messages and public 
messages (public messages are decoded by both the destinations and do not benefit from 
cooperation). Example 3 illustrates a second kind of cooperative message: cooperative-private 
message which benefits from cooperation and is decoded only by the destination which is 
interested in it. 

Example 1: For the linear deterministic case, let us consider the symmetric channel with 
direct links ni j3 = n 2) 4 = say, and interference links n li4 = n 2j 3 = rij, say, such that 
7i£> = 2nj. When source cooperation is absent, i.e., nc = 0, the sum capacity turns out to 
be rip and it can be achieved simply by time-sharing. Now, let us consider nc = Ji.d/4, and 
in particular, no — 4:,ni — 2,ric = 1. The sources transmit 

1) \ 
/ 



1) / 

which means that before time t, source 1 knows v 2 (t — 1) and source 2 knows vi(t — 1). 
The transmissions are over a long block of length T with the signals at time T such that 
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xi(t) = 



f v 1 (t)+v 1 (t-l) \ 
v 2 (t-l) 

Zla(t) 
Zlb(t) 



Note that this is possible because 

/ 

yi(t) = 



o 
o 
o 



and x 2 (t) = 



J 



and j/2 (i) 



V v 2 (t) + v 2 {t - 1) J 



( v 2 (t)+v 2 (t 
vi(t-l) 
z 2a {t) 
\ z 2b {t) 







V v 1 (t)+v 1 (t- 



ZkaiT) = Zkb(T) = 0, k = 1,2. Also, we interpret ffc(0) = 0. Then, destination 3 receives 

/ 

T — 1, and 



ys(T) 



v 1 (t)+v 1 (t-l) \ 
v 2 (t-l) 

Zla(t) +V 2 (t) +V 2 (t- 1) 

V 2l6(t) + Vi{t - 1) / 

/ « 1 (T)+« 1 (T-1) \ 
^(T - 1) 
«2(T) + u 2 (T - 1) 

V «i(T-l) / 



* = 1,2, 



At the end of time T, destination 3 starts reading off the signals backwards starting from what 
it received at time T. From y^{T) it can recover vi(T),v 2 (T),vi(T — l),v 2 (T — 1). Making 
use of the latter two, i.e., V\(T — l),v 2 (T — 1), it can recover z\ a {T — 1), Zu(T — i),v\(T — 
2),f 2 (T — 2) fromy 3 (T — 1). Then, employing its knowledge of v\(T — 2), v 2 (T — 2), it recovers 
Z\ a (T — 2), ZxbiT — 2), V\{T — 3),v 2 (T — 3) from yz(T — 2), and so on. Thus, destination 3 can 
recover {(v\(t), zi a (t), zib(t), v 2 (t)) : t = 1,2, ... ,T}. By symmetry, destination 4 can also 
recover its messages. Thus, a rate of R\ = 3, R 2 = 3 can be achieved (asymptotically as 
T — > oo). Thus we obtain a sum-rate of 6 which is in fact the sum-capacity of this channel 
with cooperation. 

The above scheme has two kinds of signals: 

• Private signals: zi a , zu, z 2a , z 2 b are recovered only by the destination to which it is 
intended. Note that these signals occupy the lower levels of the transmitted vector 
such that they do not appear at the destination where they may act as interference. 

• Cooperative-public signals: vi,v 2 . These signals are read off by the other source at the 
end of each time t and then incorporated into the transmission by both the sources 
at the time t + 1. Thus, the transmission of these signals exploits the possibility 
of cooperation among the two sources. Destinations perform "backwards decoding." 
They recover the cooperative-public signals sent cooperatively starting from the final 
received vector and proceeding backwards. Hence, the initial transmission used by the 
sources to convey the signals to each other, being already available, does not act as 
interference at the destinations. In order to facilitate the recovery of these signals at 
the sources, they occupy the higher levels of transmitted vector. 

Example 2: Let us consider the following asymmetric linear deterministic case, 711,3 = 
6,^1,4 = 3,n 2j 4 = 4,77-2,3 — 3, and n c = 1. The capacity as given by Theorem [T] is 7. 
To achieve this, the sources transmit 



xi{t) 



Ul(t) + v 2 (t 

Zla(t) 
Zlb(t) 
\ Z le {t) 



1) \ 
1) 



and x 2 (t) 



( v 2 (t)+v 2 (t 
v x {t-l) 


z 2 (t) 





1) \ 



s 



Note that this is again possible because 



yi(t) 



\ 
o 

o 

V v 2 (t) + v 2 (t - 1) J 



and y 2 (t) 







V v^t) +v 1 (t- 1) J 



and hence, source 1 knows v 2 (t — 1) and source 2 knows v\(t — 1) before time t. The 
transmissions are again over a long block of length T with the signals at time T such that at 
time T all the u, v, z signals are 0. Also, we interpret ffc(0) = 0, k — 1, 2. Then, destination 3 
receives 



ys(t) = 



( v^ + v^t-i) \ 

u x {t)+v 2 {t- 1) 

^la(t) +U 2 (0 +U 2 (*- 1) 
*l&(f)+*>l(t- 1) 

v ^i c (0 y 



t = 1,2,...,T-1, andy 3 (T) = 



^i(T-l) \ 
v 2 (T-l) 


u 2 (T-l) 
«i(T-l) 




and destination 4 receives 



y*{t) 



o 



U 2 (0+ U 2(*-l) 

ui(t) + 2ui(*- 1) 
u^t) + v 2 (t - 1) 

Ul (t)+Z 2 (t) 



t= 1,2,..., T-l, andy 3 (T) 













W2 (T- 


1) 


2«i(T- 


-1) 


^ 2 (T- 


1) 



V 



o 



Now it is easy to verify that if the destinations read off the signals starting from the vectors 
they received at time T and proceeding backwards as in the previous examples, destina- 
tion 3 can recover {(v i(t), ui(t), zi a (t), zuit), z\ c (t), v 2 (t)) : t — 1, 2, . . . , T}, and destination 4 
{(v 2 (t), z 2 (t), v Ui(t)) : t — 1, 2, . . . , T}, and hence achieve a sum- rate of 7. 

This example involved a new type of signal apart from the private and cooperative-public 
types of the previous example. 

• Public signal: u±. This signal is decoded by both the destinations. However, note 
that unlike the cooperative-public signal, the other source does not participate in its 
transmission. Indeed, this signal is transmitted in such a way that it is not visible to 
the other source. 

Both the examples above involved cooperative links which are weaker than the direct and 
interfering links. The mode of cooperation involved the sources cooperating in aiding the 
destinations recover an interfering signal. When the cooperative link is strong, yet another 
possible form of cooperation becomes feasible. The next example illustrates this. 
Example 3: Let us again consider the symmetric case, but now with «o = 4, raj = 3, ric = 5. 
Note that, the cooperation link is now stronger than both the direct and interference links. 



9 



Without cooperation, the sum capacity is 5; but with cooperation, we will show that a sum 
rate of 6 can be achieved. The sources transmit 



Xl(t) 



( ui(t)+vi(f-l) \ 
v 2 (t-l) 

Si(f) 

zi(t) - s 2 (t) 

Sl(f+1) 



and X2(t) 



( v 2 (t) +v 2 (t- 1) \ 

Vi(t-1) 

s 2 (t) 
z 2 (t) - s 1 (t) 
s 2 (t + l) 



Note that this transmission scheme is possible since yi(t) = x 2 (t), y 2 (t) = X\(t). This allows 
the sources to exchange their s(t) signals one time step in advance over the lowest level. Also, 



we set Sfc(l) 
now receive 



0,Vk(T) = 0, and interpret Vk(0) = 0,Sk(T + 1) = 0, k — 1,2. Destinations 



i/sCO 



/ o 

ui(*)+ui(*-l) 
u 2 (i) + 2v 2 (t - 1) 

si(t)+«i(t-l) 

V 

/ o 
vi(r-i) 

2v 2 (T - 1) 
Sl (T)+ Vl (T-i: 



t = 1,2, ...,T- 1, and 



/ 



Recovery of signals proceeds backwards from the last received vector as in Example 1. As- 
suming that the characteristic of the field F is not 2, destination 3 can recover the signals 
{(vi(t), Si(t), Zi(t), v 2 (t)) : t — 1,2,... , T}. This gives a rate of R\ = 3, and by symmetry a 
sum-rate of 6. 

In addition to the previously encountered private and cooperative-public types of signals, 
another kind of signal plays an important role in this example. The strong cooperation link 
allows the sources to share with each other signals which are eventually only recovered by 
the destination it is intended for. i.e., 

• Cooperative-private signals: s\,s 2 . The sources learn these signals from each other one 
time step ahead, and in the next time step they cooperate with each other to convey 
these signals only to the destination it is intended for. Note that the collaboration 
between the sources in this example is a rudimentary form of precoding. The two 
sources can be thought of as two antennas of a broadcast transmitter when they co- 
operate to transmit these signals. Only the destination to which the signal is intended 
for recovers it and the precoding ensures that no interference is caused at the other 
destination. The main differences from cooperative-public signals are: (1) the sources 
convey the cooperative-private signals to each other through the lowest levels of their 
transmission vector in such a way that it is not visible to the destinations, whereas 
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the cooperative-public signals are conveyed over the top most levels of the transmis- 
sion vector, and (2) the source collaborate in sending the cooperative-private signals by 
pre-coding the signals to ensure that no interference results at the destinations, while 
cooperative-public signals are visible to both the destinations which end up recovering 
them. Thus, while the role of cooperation is to aid both the destinations in recover- 
ing the cooperative-public signals, it aims to conceal the cooperative-private from the 
destination it is not intended for. 

In general, such uncoded schemes are not sufficient to cover all possible linear determin- 
istic channels (indeed, even in the above example, we relied on the characteristic of the field 
not being 2), and more importantly the Gaussian channels. But the basic intuition can be 
used to build coding schemes which do. These schemes are presented in the next section. 

5 Coding schemes 

We first present our key coding theorem (in Theorem [4]). It is a block-Markov (in the 
sense of [H]) coding scheme which builds on Han and Kobayashi's classical superposition 
coding scheme [5] for the two-user interference channel and has elements of decode-and- 
forward strategy [H] and backwards-decoding [15] for relay channels. The schemes are 
generic in the sense that they apply to any memoryless interference channel with source 
cooperation Py 1 ,y 2< y 3 ,y 4 ,\x 1 ,x 2 • Then, we apply these schemes to the problems at hand to obtain 
the achievability part of Theorems [I] and [2] We would like to point out that Theorem Wa) 
is identical to the one which appears in [6]. 

Theorem 4 (a) Given a joint distribution pw PVuU^x^w Pv 2 ,u 2 ,x 2 \w, the rate pair (Ri,R 2 ) 
is achievable if there are non-negative ry x ,ry 2) r\j x ,ru 2 ,r XY ,r X2 such that R± = rv 1 +ru 1 +r Xl! 
R2 = r V2 + ru 2 + r X2 , and 

r Vl <I(V i; Y 2 \W) 

r Xl <I(X 1 ;Y 3 \V 1 ,V 2 ,W,U 1 ,U 2 ), 
r m + r Xl < I(U u X i; Y 3 \V U V 2 , W, U 2 ), 
ru 2 + r Xl < I(U 2 , Xi; Y 3 \V U V 2 , W, UJ, 
r Vl + r U2 + r Xl < I(U U U 2 ,X x] Y 3 \V 1: V 2 , W), 
(r Vl + r> 2 ) + r Vx + r U2 + r Xl < I{W, V u V 2 , U u U 2 , X x ; Y 3 ), 

and the corresponding inequalities with subscripts 1 and 2 exchanged, and 3 replaced with 4- 

(b) Given a joint distribution p w Pv^u^wPv^w Ps^wPs^w Pz x \wy^,s x Pz 2 \wy 2 ,v 2 ,s 2 Px 1 \w,v 1 ,u 1 ,z 1 ,s 1 ,s 2 

Px 2 \wy 2 ,u 2 ,z 2 ,s 1 ,s 2 , the rate pair (i?i, R 2 ) is achievable if there are non-negative r Vk ,r Uk ,r Zk , r Sk , k — 
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1, 2 such that R k = r Vk + r Uk + r Zk + r Sk , k — 1, 2 and 

r Sl <I(X 1 ;Y 2 \W,S 1 ,S 2 ,Z 1 ,U 1 ,V 1 ) 
r Zl + r Sl < I(Z U X i; Y 2 \W, S u S 2 , U u V 1 ) 
r Vl + r Zl + r Sl < I(U U Z u X,; Y 2 \W, S u S 2 , V 1 ) 
r Vl + r Vl + r Zl + r Sl < I(V 1: U u Z 1: X i; Y 2 \W, S 1: S 2 ), 

r Zl < I(Z 1 ;Y 3 \V 1 ,V 2 ,W,U 1 ,U 2 ,S 1 ), 
rv, + r Zl < I(U U Z i; Y 3 \V U V 2) W, U 2) S,), 
r Sl + r Zl < I(S U Z x - Y 3 \V U V 2) W, U u U 2 ), 
r Sl + r Vl + r Zl < I(S U U^Z,; Y 3 \V U V 2 , W, U 2 ), 
r U2 + r Zl < I(U 2 , Z x - Y 3 \V U V 2 , W, U u ft), 
r U2 + r Vl + r Zl < I(U 2 , U u Z x - Y 3 \V 1: V 2) W, SJ, 
ru 2 + r Sl + r Zl < I(U 2 , S 1: Z 1 ; Y 3 \V 1: V 2 , W, U,), 
ru 2 + r Sl + rUl + r Zl < I(U 2 , S u Ui, Z x - Y 3 \V U V 2 , W), 
(r Vl + r V2 ) + r Vl + r U2 + r Sl + r Zl < I(W, V x , V 2 , U u U 2 , S u Z x \ Y 3 ), 

and the corresponding inequalities with subscripts 1 and 2 exchanged, and 3 replaced by 4- 
(c) Given a joint distribution p w Pv 1 ,u 1 \wPv 2 ,u 2 \w Ps^w Pz 1 \w,v 1 ,u 1 ,s 1 Pz 2 \w,v 2 ,u 2 Px^wy^u^z^ 
Px 2 \w,v 2 ,u 2 ,z 2 ,s±, the rate pair (R±, R 2 ) is achievable if there are non-negative ry k ,ru k ,r Zk , k = 
1, 2, and r Sl , such that R 1 = r Vl + r Vl + r Zl + r Sl , R 2 = r Vl + r Ul + r Zl , and 

r Sl <I{X l] Y 2 \W,S l ,Z l ,U 1 ,V 1 ) 
r Zl +r Sl </(^i,X 1 ;y 2 |W,S'i,C/i,yi) 
r Vl + r Zl + r Sl < I(U 1: Z u X,; Y 2 \W, S 1: V 1 ) 
r Vl + r Vl + r Zl + r Sl < J(K, U u Z u X ± ; Y 2 \W, SJ, 

r V2 <I(V 2 ;Y 1 \W,S 1 ), 

r Zl < I(Z 1 ;Y 3 \V 1 ,V 2 ,W,U 1 ,U 2 ,S 1 ), 
ru, + r Zl < /(C/i, Z l5 Y 3 \V U V 2 , W, U 2 , SJ, 
r Sl + r Zl < I(S 1} Z x - Y 3 \V U V 2 , W, U u U 2 ), 
r Sl + r Vl + r Zl < I(S U U u Z x - Y 3 \V X , V 2 , W, U 2 ), 
ru 2 + r Zl < I(U 2 , Z i; Y 3 \V 1: V 2) W, U u S,), 
ru 2 + r Vl + r Zl < I(U 2 , U u Z x - Y 3 \V U V 2 , W, S,), 
ru 2 + r Sl + r Zl < I(U 2 , S u Z,- Y 3 \V U V 2 , W, U,), 
ru 2 + r Sl + r Vl + r Zl < I(U 2 , S u U u Z i; Y 3 \V ± , V 2 , W), 
(r Vl + r V2 ) + r Vl + r U2 + r Sl + r Zl < I(W, V u V 2 , U u U 2 , S u Z x - Y 3 ), 
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rz 2 <I(Z 2 ;Y 4 \V 1 ,V 2 ,W,U 1 ,U 2 ), 
r U2 + r Z2 < I(U 2 , Z 2 ; Y 4 \V U V 2 , W, U x \ 
rui + rz 2 < I{U X , Z 2 - Y 4 \V h V 2 , W, U 2 ), 
r Vl + r U2 + r Z2 < I{U X , U 2 , Z 2] Y 4 \V U V 2 , W), 
(r Vl + r V2 ) + r Vl + r U2 + r Z2 < I(W, V 1: V 2 , U x , U 2 , Z 2 ; Y 4 ). 

We prove this theorem in Appendix |Aj Here, we interpret these theorems in the context 
of the examples and discussion in the previous section: 

Scheme (a) involves the source nodes aiding their respective destination nodes decode part 
of the interference (the cooperative-public message from the interfering source) by essentially 
retransmitting the part of the interference observed by the source. In order to facilitate this, 
the cooperative-public message is coded separately (in V\ and V 2 ) which are decoded by the 
sources and retransmitted (in W). There are further public messages (U\ and U 2 ) which are 
not aided by the other source, and private messages as well. This scheme is closely related 
to Example 1 of the previous section. 

In scheme (b), in addition to the above, the sources collaborate in sending private mes- 
sages by sharing with each other in advance the part of the message on which they want to 
collaborate. Thus, the connection is to Example 3 of the previous section. The auxiliary 
random variables in scheme (b) have the following interpretation: 



Aux. Decoding Remarks 
RV destinations 



Vi 


3,4 


cooperative-public message from source 1 


v 2 


3,4 


cooperative-public message from source 2 


w 


3,4 


used by the sources to cooperatively send the two cooperative-public messages 


Ul 


3,4 


public message from source 1 


u 2 


3,4 


public message from source 2 


Si 


3 


carries the cooperative-private message to destination 3 


s 2 


4 


carries the cooperative-private message to destination 4 


Zi 


3 


private message from source 1 


z 2 


4 


private message from source 2 



Note that scheme (a) is not a special case of (b) as it might appear. The key difference is 
that while in scheme (b), the sources perform a joint decoding of all the messages sent by 
their counterparts (including the public and private messages which they do not aid in the 
transmission of), in scheme (a), only the part of the cooperative-public message meant to used 
for collaboration is decoded treating all the other messages as noise. At low strengths for the 
cooperative link, scheme (a) can perform better than scheme (b) specialized to not include 
a cooperative-private message. On the other hand, for strong cooperative links, adopting a 
joint decoding scheme at the sources can lead to better overall rates for other messages. 
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Scheme (c) combines these two schemes in a limited manner. Only source node l's trans- 
mission benefits from transmission of some cooperative-public and some cooperative-private 
messages while source node 2's transmission benefits only from collaborative transmission of 
some cooperative-public message. Source node 1 adopts a decoding strategy similar to that 
adopted by the sources in scheme (a) whereas source node 2's decoding strategy is similar 
to the one in scheme (b). The three schemes (a), (b), and (c) can be easily combined into 
a single scheme (at the expense of considerably more involved notation) which may improve 
the achievable region in general, but since the focus here is on the sum-rate of the Gaussian 
case (within a constant gap) and linear deterministic cases which are obtained by considering 
the schemes separately, we do not explore this here. 

Intuitively, a source employs a cooperative-private message only when the cooperative link 
is stronger than the direct link from this source, and the source shares this message with the 
other source in advance by having it "ride below" the other messages it sends. Thus, one of 
the differences of our scheme from the proposals of [7J [Sj is that when sharing the cooperative- 
private message in advance, the decoding source performs a joint decoding of all the messages 
including those messages it does not aid in the transmission of, rather than treat these other 
messages as noise. Note that in our schemes (b) and (c), the sources cooperate in sending 
the cooperative-private messages by employing a simple form of precoding along the lines of 
Example 3 in the last section. Use of dirty-paper coding [TBI [T7| instead may lead to an 
improved gap. 

The achievability proofs presented in Appendix [B] and [C] of our main theorems (Theo- 
rems [T] and [5]) make use of these schemes. 

6 Interference Channel with Feedback 

A closely related problem is that of the interference channel with feedback. Let us consider 
the symmetric Gaussian interference channel with noiseless feedback. As in the model we 
considered earlier, 

Y 3 (t) = hjXiit) + h 2 , 3 X 2 {t) + N 3 (t), 
Y 4 (t) = h 2 , 4 X 2 (t) + h XA X x {t) + N 4 (t). 

However, instead of receiving signals through the cooperation link, the sources now receive 
noiseless feedback from their respective destinations, i.e., 

Yi(t) = Y 3 (t), 
Y 2 {t) = Y A (t). 

As before, the transmissions from the sources are deterministic functions of their messages 
and their observations (here, the feedback received) in the past. Since the sources have access 
to the symbols they transmitted in the past, it is clear that the above problem is equivalent 
to one where the sources observe 

Y 1 (t) = h^X 2 (t) + N 3 (t), 
Y 2 (t) = h 1A Xi(t) + N 4 (t). 
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We can rewrite these as 



Y 1 (t) = h 2)1 X 2 (t)+N 1 (t), 
Y 2 (t) = h 1 , 2 X 1 (t) + N 2 (t). 

Here h 2) i = h 2j3 , hi )2 = hi^, Ni(t) = N 3 (t), and N 2 (t) = N^t). This is identical to the 
channel model stated at the beginning of section [2]except for the fact that now N\,N 2 , N$, N4 
are not independent, but jVj and A3 are identical, and so are A2 and A4. 

The above difference notwithstanding, we will argue below that the results presented in 



Section 3.1 on the sum-capacity of the interference channel with source cooperation have a 
bearing on this model as well. Note, however, that our source cooperation result was proved 
under the restriction that \hi }2 \ = \h 2 ,i\- In general, this may not hold true for interference 
channels with feedback, but a range of channels including, most importantly, the symmetric 
interference channel is covered. We will focus our attention on the symmetric channel, where 
|^i,3 1 = 1^2,4 1 = h D , and \h 1A \ = \h 2>3 \ = hi. 

Let us begin by noting that a linear deterministic formulation of the above feedback 
problem is identical to the one with source cooperation, and hence, Theorem [T] applies 
directly. Turning to the Gaussian case, let us note that the achievability proof for the source 
cooperation case depended only on the marginal distributions of the noises and not on their 
correlation. Hence the achievability proof also holds directly. We only need to argue that 
the converse also applies. In appendix [E], we will show that the biting upperbound for the 
symmetric channel with noiseless feedback indeed holds to give us the following proposition: 



Proposition 5 The sum- capacity of the symmetric, Gaussian interference channel with out- 
put feedback is within a constant (19 bits) of the minimum of the following terms: 

W(. + W + Ml(i + =^). M> 

This case was studied independently in [TU] which also characterizes the sum-capacity and 
obtains a better constant than we do here. 



7 Discussion 
7.1 Reversibility 

A related setting to the one studied in this paper is the interference channel with destina- 
tion cooperation. This case will be presented in a companion paper [18]. An interesting 
reversibility property connects the two settings. We briefly discuss it here. For brevity, the 
linear deterministic model will be employed. 

In the destination cooperation case, the destinations can not only receive, but they can 
also transmit. But these transmissions have to satisfy a causality constraint - the transmis- 
sions from each destination is a function of everything it has received up to the previous time 
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1 nifl 3 3 ^3 1 




(a) Source Cooperation (b) Destination Cooperation 



Figure 3: Source Cooperation and Destination Cooperation. Note that the destination 
cooperation channel (b) is obtained from the source cooperation channel (a) by: (i) reversing 
the roles of the sources and destinations, i. e., nodes 1 and 2 are sources in (a) and destinations 
in (b); nodes 3 and 4 are destinations in (a) and sources in (b), and (ii) reversing the directions 
of all the links while maintaining the link strengths to be the same. Both channels have the 
same sum-capacity. 

instant. In order to illustrate the reversibility between destination and source cooperation 
scenarios, we will number the nodes as shown in the Fig. [3](b) : 3 and 4 are the source nodes 
now which want to communicate to destination nodes 1 and 2 respectively, and there is a 
cooperation link between the destination nodes. The destination nodes receive 

Yx(t) = h 1:3 (X 3 (t)) + h* 1A (X 4 (t)) + h, 2 (X 2 (t)), 
Y 2 (t) = h 2A (X 4 (t)) + h* >3 (X 3 (t)) + M*i(*))- 

where the (deterministic) encoding functions at the sources are of the form 

X k (t) = f k , t (M k ), A; = 3,4, 

and the (deterministic) relaying functions at the destinations are of the form 

X k (t) = f k , t (Yt l ), k = 2,l. 

Let ni )3 , n 2j 3, 111,4, n 2 ^ n.2,1, be non-negative integers and n = f max(ni i3 , n 2i 3, rii j4 , n 2 ,4, n.2,1, 
The inputs to the channel X 3 and X 4 are n-length vectors over a finite field F. We define 



h 1>3 (X 3 ) 


= s n - 


-"1,3 V 

A 3 ? 


h 2A (X,) 


= s n - 


-"2,4 y 




= S n ' 


-"1,4 V 


^2,3(^3) 


= s n - 


-"2,3 y 


h, 2 (x 2 ) 


= s n - 


-"1,2 V 


h 2 , x {X x ) 


= s n - 


'^Xl 
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Further, to model the reciprocity of the links between the two receivers, we set 712,1 = ^1,2 = 
n c . 

The reversibility property is that the sum-capacity expression in Theorem [T] is also the 
sum-capacity of the above channel with destination cooperation. This turns out to be a 
feature of our achievable strategy which holds in more general cases (larger networks, more 
number of sources-destination pairs etc.) as discussed further in [19]. It would be interesting 
to investigate whether the optimality results presented here also extend to more general 
settings. 



7.2 Dependence on channel state information 

Throughout the paper we assumed that full channel state information is available at both the 
sources and the destinations. However, this can be relaxed. The application of the scheme in 
Theorem[4^a) only requires the sources to know the channel strengths to the two destinations, 
and not their phases. Note that Theorem ffl^a) caters to the case where the strength of the 
cooperative link is weaker than that of all the links to the two destinations. But, to apply 
the schemes in Theorem |4^b) and (c), we do require the sources to have full channel state 
information. This is not surprising since the analogous setting of a multiantenna broadcast 
channel also requires full channel state information. 



A Proof of Theorem 4 



(a) We present a block-Markov scheme with backwards decoding. Given PwPv 1 ,u 1 ,x 1 \wPv 2 ,u 2 ,x 2 \w, 
we construct the following blocklength-T codebooks: 

• W codebook: We create a W-codebook Cw of size 2 Trw with codewords of length n by 
choosing the elements independently according to the distribution p w . We will denote 
the codewords by cw{iti>w) where my/ e {1,. . . ,2 T ^- £ )}, where e > 0. 

• V codebooks: For each codeword cw{mw) £ an d for each k — 1, 2, we create 14- 
codebook Cy k (rn w ) of size 2 T< - rv fe~^ respectively, by choosing elements independently ac- 
cording to pv k \w i-\ w ) where w is set to the respective element of the cwijnw) codeword. 
These codewords will be denoted by cy k {m Vk ,m w ) where m Vk e {1, . . . , 2 T ( rv fc~ e )}. 
Moreover, we set 

rw = r Vl + 7y 2 . 

• U codebooks: For each codeword cy h (mv k , mw) , we create a £4-codebook Cjj k ijnv k > inw) 
of size 2 T ( ru k~ e > by choosing elements according to pu k \v k ,w{-\ v k,w) by setting Vk and 
w to be the respective elements of the Cy fc (my fc , mw) and cw{mw) codewords. 

• X codebooks: Finally, for each codeword cu k (rriu k ,mv k ,mw), we similarly create a 
X fc -codebook Cx k {mu k ,mv k ,mw) of size 2 T ^" £ ) using px k \u k ,v k ,w- 
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Encoding: For block-j, we will assume for the moment that the source nodes have successfully 
decoded the messages my k (j — 1). Then the encoding proceeds as follows. Both encoders set 
m w(j) — ( m v 1 (j ~ ^), m v 2 (j ~ !))■ They then proceed to choose the codewords cw(nT,w(j)), 
c v k (m Vk (j),m w (j)), c Uk {m Uk (j),m Vk (j),m w (j)), and c Xk (m Xk (j),m Uk (j),m Vk (j),m w (j)). 
The X-codewords are transmitted. For the first block, j = 1, we set m w (l) = 1 and for the 
last block J, we set my 1 (J) = my 2 (J) = 1. 

Decoding at the sources: At the end of block-j, source 1 will try to decode the message my 2 (j) 
from source 2 and vice versa. Using standard arguments, we can show that for joint-typical 
decoding to succeed (with probability approaching 1 as the blocklength n approaches oo), it 
is enough to ensure that 

r Vl <Iiy^Y 2 \W), and 
rv 2 ^Uy^Y^W). 

When this decoding fails, we will say that "encoding at the sources has failed at block- j," 
and declare an error. 

Decoding at the destinations: Destinations perform backwards decoding. We will assume that 
before destination 3 processes block-j, it has already successfully decoded mw(j + 1) = 
(my 1 (j) , my 2 (j)) . This is true for j = J since my 1 (J) = my 2 (J) = 1. And, for each j, we will 
ensure that from block- j, destination 3 decodes mw(j) successfully thereby ensuring that 
the above assumption holds true. Assuming that my 1 (j), my 2 (j) is available at destination 3, 
we will try to ensure that from block-j, the messages mw{j), mil! (j), m x 1 (j) are successfully 
decoded. In trying to decode these messages, destination 3 will also try to jointly decode 
the message mu 2 (j). It can be shown that a high probability of decoding success can be 
ensured (i.e., the probability of failure in decoding the messages mw(j), mu^j), and mx 1 {j) 
from what destination 3 receives in block-j assuming m w (j + 1) is available, goes to as 
blocklength n goes to oo) if the following conditions are met. 

r Xl <I(X 1 ;Y 3 \V 1 ,V 2 ,W,U 1 ,U 2 ), 
r Vl + r Xl < I(U U X i; y 3 |Vi, V 2 , W, U 2 ), 
r U2 + r Xl < I(U 2 , X i; Y 3 \V!, V 2 , W, Ui), 
r Ul +r U2 +r Xl < I(U U U 2 , X^V,, V 2 ,W), and 
(r Vl + ry 2 ) + r Vl + r Ua + r Xl < I(W, V u V 2 , U u U 2 ,X 1 ; Y 3 ). 

A similar set of conditions ensure success of decoding at destination 4. If decoding fails 
for block-j for either of the destinations, we will say that "decoding failed at block-j" and 
declare an error. 

Overall, an error results if for at least one block-j, either encoding fails or decoding fails. 
Since there are a finite number J of blocks, by union bound, the above discussion implies that 
the probability of error goes to as the blocklength goes to oo when the above conditions 
are met. This completes the random coding argument. 

(b) We present a block-Markov scheme with backwards decoding at the destinations. Given 

PW PVi,Ui\W PV 2 ,U 2 \W PSi\WPS 2 \W PZ!\W,Vi,Ui,Si PZ 2 \W,V 2 ,U 2 ,S 2 PXi|W,Vi,C/i,Zi,5i,52 Px 2 \W,V 2 ,U 2 ,Z 2 ,Si,S 2 , 

we construct the following blocklength-n codebooks: 
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• W, V, and U codebooks: These codebooks are constructed as in scheme (a) above. 
We create a jy-codebook Cw of size 2 Trw with codewords of length n by choosing the 
elements independently according to the distribution pw We will denote the codewords 
by c w (m w )i where m w G {1, . . . , 2 T ( rw ~ e ' ) }, where e > 0. 

For each codeword cy/ijnw) £ C>w, an d f° r each k = 1,2, we create Vfc-codebook 
Cv k (mw) of size 2 T ^ Vv k~^ respectively, by choosing elements independently according 
to p Vk \w{-\ w ) where w is set to the respective element of the cw{mw) codeword. These 
codewords will be denoted by cv k (mv k , m w) where my k G {1, . . . , 2 T ^ rv k~ 2 ^}. Moreover, 
we set 

rw = r Vl +r V2 . 

For each codeword cv k (mv k ,mw), we create a C4-codebook Cu k ( m v k , mw) of size 
2 T ( Vu k~ e } by choosing elements according to Pu k \v k ,w(-\ v k, w ) by setting vj- and w to 
be the respective elements of the cv k (m Vk ,mw) and cwijnw) codewords. These code- 
words will be denoted by cu k (mu k ,iv k , m w) where m Uk G {1, . . . ,2 T( - ru k~^}. 

• S codebooks: For each cw(mw) £ Cw, an d for each k — 1, 2, we create an Sfc-codebook 
Cs k {m w ) of size 2 T(rs fc~ e ) respectively, by choosing elements independently according 
to ps k \w{-\w) where w is set to the respective element of the cwijnw) codeword. These 
codewords will be denoted by c Sk (m Sk ,m w ) where m Sk G {1, . . . , 2 T(rs fc~ 2e )}. 

• Z codebooks: For each pair of codewords (cu k (mu k , m^, mw), cs k (ms k , mw)) £ 
Cu k (m Vk ,m w ) xCs k (m w ), and for each k — 1,2, we create a Z fc -codebook 
Cz k (rnu k ,mv k ,mw,is k ) of size 2 T(rz fc~ e ) respectively, by choosing elements indepen- 
dently according to Pz k \wy k ,u k ,s k (-\w, v k , u k , s k ) where w, v k , u k and s k are set to the re- 
spective elements of the c w {m w ), c Vk (m Vk ,m w ), c Uk (m Uk ,m Vk ,m w ), and c Sk (m Sk , m w ) 
codewords, respectively. The codewords so generated will be denoted by 
c Zk (m Zk ,m Uk ,m Vk ,m w ,m Sk ) where i Zk G {1, . . . , 2 T(rz fc" e) }. 

• X codebooks: Finally, consider pairs of codewords 

(c Zk (m Zk ,m Uk ,m Vk ,m w ,m_ Sk ),c Sk (m Sk ,m w )) G C Zk (m Uk ,m Vk ,m w ,m Sk ) x C S - k (m w ), 
for each k — 1,2, where k — 2, if k — 1, and k — 1, if k — 2. For each pair, we 
create a X fc -codebook Cx k (m Zk ,m Uk ,mv k ,m w ,is k ,is k ) of size 2 T( - rs k~^ by choosing el- 
ements independently according to px k \w,v k ,u k ,z k ,s k ,s k (-\w,v k ,u k ,s k ,s k ) where w, v k , 
Uk, %k, Sk, and s k are set to the respective elements of the cwimw), c v k {iv k , m w), 
c Uk (i Uk ,i Vk ,m w ), c Zk (i Zk ,i Uk ,i Vk ,m w ), c Sk (is k ,m w ), and c S - k (is- k ,m w ) codewords, re- 
spectively. The codewords so generated will be denoted by 

cx(mpvT-coop k ,rn Zk ,rn Uk ,rn Vk ,rn w ,i Sk ,is k ) where m PV T-coop fc e {1, • • • , 2 T(rs ^ e) }. 

Encoding: For block- j, we will assume for the moment that the source nodes have success- 
fully decoded the messages rny k {j — 1), mpvT-coop fc ( j — 1)- Then the encoding proceeds 
as follows. Both encoders set mw(j) = ( m v 1 (j _ l)? m v 2 (i ~~ !))■ Then, encoder-/c pro- 
ceeds to select the codewords c w (m w (j)), c Vk (m Vk (j),m w (j)), c Uk (m Uk (j),m Vk (j),m w (j)), 
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c s k (m Sk (j),m w (j)), and cz k (mz k (j),rn Uk (j),m Vk (j),m w (j),i Sk (j)). It transmits the X- 
codeword 

cx(rn P vT-coop k U),rn Zk (j),m Uk (j),m Vk (j),m w (j),m S -^ For the first block, j = 

1, we set mi4/(l) = m Sl (l) = m S2 (l) = 1 and for the last block J, we set m Vl (J) = my 2 (J) = 

"^PVT-COOPi(^) = ^PVT-COOP 2 (-0 — 1- 

Decoding at the sources: At the end of block-j, source 2 will try to jointly decode 

m Vl (j) , m Vl (j) , m Zl (j) , ^pvt-coopx (j) 

from source 1 and vice versa. Note that both sources have access to the W, S±, and S2- 
codewords. For joint-typical decoding to succeed (with probability approaching 1 as the 
blocklength T approaches 00), we can show that it is enough to ensure that 

r Sl <I(X 1 ;Y 2 \W,S 1 ,S 2 ,Z 1 ,U 1 ,V 1 ) 
r Zl +r Sl <I(Z 1 ,X 1 ;Y 2 \W,S 1 ,S 2 ,U 1 ,V 1 ) 
r Vl + r Zl + r Sl < I(U 1: Z u X x ; Y 2 \W, S u S 2 , V,) 
r Vl + r Vl + r Zl + r Sl < /(K, U u Z u X x] Y 2 \W, S 1: S 2 ). 

When this decoding fails, we will say that "encoding at the sources has failed at block-j," 
and declare an error. 

Decoding at the destinations: Destinations perform backwards decoding. We will assume 
that before destination 3 processes block-j, it has already successfully decoded m w (j + 1) = 
(m Vl (j) , mv 2 (j)) . This is true for j = J since m Vl (J) = m v , 2 (J) = 1. And, for each j, we will 
ensure that from block-j, destination 3 decodes mw{j) successfully thereby ensuring that 
the above assumption holds true. Assuming that mv 1 (j), % 2 (j) is available at destination 3, 
we will try to ensure that from block-j, the messages mw(j), m Ui(j), m z 1 (j), and ms 1 (j) 
are successfully decoded. In trying to decode these messages, destination 3 will also try to 
jointly decode the message mu 2 (j). The decoding is performed by looking for a unique tuple 
of W, Vi, V 2 , U 2 , Ui, Zi, Si codewords consistent with the information already known (namely, 
mv 1 {j) , mv 2 {j)) and which are jointly (strongly) typical with the (T-length) block of signal 
Y 3 received by destination 3 corresponding to the block-j. Using standard arguments, a high 
probability of decoding success can be ensured (i.e., the probability of failure in decoding the 
messages m w (j), m^j), and m Zl (j) and isi(j) fr° m what destination 3 receives in block-j 
assuming m w (j + 1) is available, goes to as blocklength T goes to 00) if the following 
conditions are met. 

r Zl < I(Z 1 ;Y 3 \V 1 ,V 2 ,W,U 1 ,U 2 ,S 1 ), 
ru, + r Zl < I(U 1: Z 1 ;Y 3 \V 1: V 2) W, U 2 , S ± ), 
r Sl + r Zl < I(S 1} Z x - Y 3 \V U V 2 , W, U u U 2 ), 
r Sl + r Vl + r Zl < I(S U U 1: Z x - Y 3 \V 1: V 2 , W, U 2 ), 
r U2 + r Zl < I(U 2 , Z 1 ;Y 3 \V 1: V 2 , W, U u S ± ), 
r U2 + r Vl + r Zl < I(U 2 , U u Z i; Y 3 \V U V 2 , W, S 1 ), 
r U2 + r Sl + r Zl < I(U 2 , S 1 , Z x - Y 3 \V 1: V 2 , W, U 1 ), 
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ru 2 + r Sl + r Vl + r Zl < I(U 2 , Si, U 1: Z l5 Y 3 \V 1: V 2 , W), and 
(r Vl + r V2 ) + rUl + r U2 + r Sl + r Zl < I(W, V u V 2 , U u U 2 , S u Z x - Y 3 ). 

A similar set of conditions ensure success of decoding at destination 4. If decoding fails 
for block-j for either of the destinations, we will say that "decoding failed at block-j" and 
declare an error. 

For the purposes of illustration, let us see how one of the above conditions is arrived 
at. One of the possible error-events under which decoding at destination 3 fails is when, for 
block-j, only the following decoding errors occur: m Zl ^ Tn Zl , mu 2 m u 2 , an d rrisi m s l - l 
where the"s indicate the decoded values. The probability of this (under random coding as 
described above) is 



E 



P \{c w , c Vl , cy 2 , d/i , c Zl , c Sl , c U2 , Y 3 T ) e T 6 



mz, ,mu ,m s . 



m w ,m Vl , m V2 , , m U2 , m Zl , m Z2 , 



mSi , ?-S 2 , "^PVT-COOPi , ^PVT-COOP 2 J ) 

where we suppressed the indices for the codewords, and the time-index j for the codeword 
indices of the conditioning event. The unhatted codewords have indices from the messages 
of the conditioning event, while the hatted codewords are short-hand notations for the code- 
words with the corresponding indices replaced by their hatted forms: 

c Zl = c Zl (m Zl ,m Ul ,m Vl , m w , m Sl ) , 
c Sl = c Sl (m Sl ,m w ), and 
cu 2 = cu 2 (m U2 ,mv 2 , m w ) . 

We also suppressed the subscript for the 5-typical set T W y 1 y 2jUlU2ZlSlY3 - We will continue 
to do that in the sequel; the appropriate subscripts will be clear from the context. Below, 
we will also suppress the conditioning event. Then, 



(c w , c Vl , c V2 , c Vl , c Zl , c Sl , c U2 , Yl) e T s 

< E P (( C ^> C Vi,C V2 , CuJ = (C W , C Vl , C V2 , Cf/J) 



P \(c w , c Vl , c V2 , c Ux , c Zl , c Sl , c U2 , Y 3 T ) e 



Cw , Cy 1 , Cy 2 , Cu-J — [Cw, Cy x , Cy 2 , Cu x , 



Further, 

P {(Cw, CVi , Cy 2 , C Vl , C Zl , C Sl , C U2 , Kj T ) G T 5 (c W , C Vl , Cy 2 , C Vl ) = (c W , C Vl , Cy 2 , C Vl )) 

= P ((CZl , C Sl , C U2 , Kj T ) = (c Zl , C Sl , CU 2 , ^T) ( C ^, «Vi , CV 2 ,C Vl ) = (C W , C Vl , Cy 2 , Cj/j) , 

(cz 1 ,c Sl ,J (72 ,?3 T )6r 5 
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where T s in the summation index is the set of conditionally 5-typical Z\, Si, U2, Y 3 sequences 
conditioned on the (W, Vi, V 2 , C/i)-typical sequence (cw, 5vi, £y 2 , Q/J. Note that the cardinal- 
ity of this set is upperbounded by 2 T ( H{ - z ^ Sl > U2 > Y;i \ wy ^ V2 ' u ^ +6 \ 



< 2~T(H(Z 1 ,S 1 ,U2\W,Vi,V2,Ui)-S)2-T(H(Y 3 \W,V 1 ,V2,U 1 )-5) 

where the first step follows from the independence of the hatted-codewords and what des- 
tination 3 receives conditioned on the unhatted-codewords. Combining everything, we can 
conclude that the probability of the error-event under consideration is less than or equal to 



which goes to zero as the blocklength T goes to 00 if we choose < 5 < e, and the rates 
satisfy the condition 



Overall, an error results if for at least one block- j, either encoding fails or decoding fails. 
Since there are a finite number J of blocks, by union bound, the above discussion implies that 
the probability of error goes to as the blocklength goes to 00 when the above conditions 
are met. This completes the random coding argument. 

(c) This scheme is a combination of the two schemes above. Now, only destination 3 receives 
a private message sent cooperatively by the two sources. Hence, only an Si codebook is 
present and there is no S2 codebook. The W, V and U codebooks are exactly as in (a) and 
(b). The Si codebook is identical to scheme (b). The Z and X codebooks are also similar 
and differ only in that the distribution used to construct them has no S2, and there is no 
fftpvT-coop 2 - Hence, we may set Z 2 = X 2 , and set X 2 codeword to be identical to the Z 2 
codeword. 

The encoding at node 1 proceeds exactly as in scheme (b) above except that, since there 
is no S2 codeword, node 1 need only decode the ^-codeword. Unlike in scheme (b), node 1 
treats all the other codewords from node 2 as noise when decoding the V 2 codeword. Thus, 
the only condition imposed by decoding at node 1 is 



Encoding at node 2 is exactly as in scheme (b) except for the fact that there is no S2 
codeword. Decoding at the destination 3 is identical to that in scheme (b) while that at 
destination 4 is identical to that in scheme (a). 



P ((czi, c Sl , c U2 , Y" 3 T ) = (c Zl ,c Sl ,c U2 , Y3 71 ) (c w , c Vl ,c V2 , c Vl ) = (c w , c Vl ,c V2 ,c Vl 

= F ((c Zl ,C Sl ,C U2 ) = (cz^Cs^Cu^Cw^v^Cv^Cu^) = (c^Cy^Cy^QyJ) • 




2T(r U2 +r Sl +r Zl -I(U2,S 1 ,Z 1 ;Y 3 \W,V u V 2 ,U 1 )-3e+3S) 



ru 2 + r Sl + r Zl < I(U 2 , Si,Z x - Y 3 \W, V u V 2 , U ± ). 



r V 2<I{V2\Yi\W,Si). 
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B Proof of achievability of Theorem [T] 



If we fix n 13 , n lj4 , n 2)3 , and n 2j4 , and consider the u^s in ([Tj)- ([4]) as functions of n c , the 
sum-rate expression in Theorem [T] (as a function of nc) breaks up into four natural regimes. 
We use different strategies to achieve the sum-capacity in different regimes. The regimes 
are: 

(i) nc < n m[n = f min(rai j 3, rii j4 , ra 2 ,3, ^2,4). It can be shown that for nc > n min , 

Ui{n c ) > min(M2(n c .),M3(n c ),M4(n c ),M 5 ). 

Hence, we need consider u\ only in the regime nc < ^min- Moreover, in this regime, 
■u 2 (nc) through « 4 (nc) are constants (i.e., they do not depend on nc and their val- 
ues are the same as when nc = 0). Since U\(nc) is monotonically increasing in nc, this 
means that we need to employ cooperation only when Wi(0) < min(-u 2 (0), w 3 (0), -u 4 (0), w 5 ), 
i.e., when 

max(n li3 - n 1A , n 2j3 ) + max(n 2)4 - n li4 , n 1)4 ) 

< min(max(ni i3 ,n 2i3 ) + (max(n 2>4 , n 2)3 ) - n 2)3 ) , 

max(n 2i4 , n M ) + (max(ni j3 , n lj4 ) - n M ) , ni j3 + n 2j4 ). (13) 

When the above condition is not true, the sum-rate expression reduces to the sum- 
capacity without cooperation. We show below how Theorem |4|a) implies that the 
sum-rate expression is achievable in this region, both when cooperation is required and 
not. 

(ii) n min < nc < min(rii i3 , n 2]4 ). In this regime, we can observe that the sum-rate expres- 
sion takes on a constant value since u 2 (n c ), u 3 (n c ), and u^ijic) are still constants. 
Hence, the achievability here is implied by the achievability in regime (i). 

(iii) min(ni j3 , n 2j4 ) < nc < max(ni j3 , n 2i4 ). In this regime, we use Theorem |4]^c). 

(iv) max(rzi i3 , rz 2j4 ) < nc- The sum-capacity is achieved in this regime by applying Theo- 
rem |4^b). 

For integer q satisfying 1 < q < n, we define 

T q = {x G F" : x t = 0, % < q} , 

i.e., all vectors in ¥ q such that their components in the range 1, . . . ,q are zeros. We take the 
indexing of the elements of vectors to start from the top as usual. For example, for binary 
field and n = 4, 

01 r "I r 1 roi^ 

I 

' ' 1 ' 1 [' 

J L 1 J L J L 1 J J 




2.3 



Regime (i): When the condition (|13j) holds, we consider a restricted regime of nc where 

ui(n c ) < mm(u 2 (nc),u 3 (nc),u 4 (nc),u 5 ). 
Since u\(nc) is monotonic in nc, it is enough to prove achievability in this regime to obtain 



achievability in regime (i) when (13) holds. We make the following choices for the auxiliary 
random variables in Theorem a): W, V\ 1 Ui, Zi, V 2 , U 2 , Z 2 are independent of each other and 
uniformly distributed over their alphabets which are as follows - V\, V 2 G F n , Ui,U 2 G T nc , 
Z\ G T nxA -, and Z 2 G T n2 - A - W is independent of all these and has the same cardinality as 
(Vi,V 2 ). X\ and X 2 are defined as 

X 1 = V X + U 1 + Z X , 
X 2 = V 2 + U 2 + Z 2 . 

This defines PwPv 1 ,u 1 ,x 1 \wPv 2 ,u 2 ,x 2 \w ■ These choices are such that destination 4's observa- 
tion does not depend on the "private" signal Zi, and, similarly, destination 3's observa- 
tion does not depend on Z 2 . With these choices, the conditions on the non-negative rates 
rv 1 ,r V2 ,r Ul ,r U2 ,r Zl ,r Z - 2 after removing redundant conditions are 

rvi < n c 

Tz x < [ni,3 - n M ], 
ru! + rzj. < ni )3 - n c , 
r U2 + r Zl < max(ra lj3 - n 1A , n 2 , 3 - n c ), 
rut + r U2 + r Zl < max(ni j3 - n c , n 2;i - n c ), 
rvi + rv 2 + r Ux + r U2 + r Zl < max(ni j3 , n 2i3 ), 

and the corresponding inequalities with subscripts 1 and 2 exchanged, and 3 and 4 exchanged. 
Further, we make the following choices for the rates. 

rxi = [ni j3 - ni i4 ]+, 
rz 2 = [n 2A ~ n 2>3 }+, 
rvi = r v 2 = n c , 

= max(n 2 ,4 - n 2 ,3, ^1,4 - n c ) - [n 2 ,4 - ^2,3] + , and 
r U2 = max(ni i3 - n M , n 2 , 3 - n c ) - [n 1>3 - n lj4 ] + , 

where [x] + = max(x,0). It can be shown that under the restricted regime of nc, these 
choices satisfy all the conditions above. The resulting sum- rate is U\{nc) as required. 



When condition (13) does not hold, as we mentioned earlier, it is enough to prove that 
the sum-rate at nc = is achievable. We apply Theorem [it a) where we set W, Vi, V 2 to be 
absent, and Ui, U 2 , Z x , Z 2 to be independent and uniformly distributed over their alphabets. 
Ui, U 2 G F n , Zi G f ni4 , Z 2 G -F n2 3' anc ^ -^1 anc ^ "^2 are defined as 

X 1 = V 1 + U 1 + Z 1 , 

x 2 = v 2 + u 2 + z 2 . 
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The conditions on the non-negative rates ru 1: ru 2 , r Zl , Tz-% after removing redundant condi- 
tions are 

r Zl < [ni, 3 -n M ], 
rui + r z 1 < n lt3 , 

ru 2 + r z L < max(ni i3 - n M , n 2 , 3 ), 
tux + r u 2 + r z 1 < max(n 1)3 , n 2j3 ), 

and the corresponding inequalities with subscripts 1 and 2 exchanged, and 3 and 4 exchanged. 
Applying Fourier-Motzkin elimination, we can show that a sum-rate of 
min(-ui(0), ^2(0), ^3(0), 7x4(0), M5) is achievable. 

Regime (Hi): Without loss of generality, let us assume that 7^3 < nc < 7i2, 4 . We will apply 
Theorem |2](c) in two different ways to show achievability in this regime. The first application 
covers (1) n 2)4 > 71^4, while the second covers (2) n 24 < n 14 . 

For case (1), n 24 > 711,4, we use the following choices for the auxiliary random variables: 
U\ is a constants. U 2 , Vi, V 2 , Z\, Z 2 , 61,3, S^ 3 , and S[ are chosen to be independent and 
uniformly distributed over their alphabets. The alphabets are: V%,V2 G F n , Z\ G J r niA , 

^26^ ri 2>3 5 <Sl,3 G F , Si G .T 7 max(ni,3, "1,4); S 2,3 e ^2,4 and C/2 e ^n' , where n' c (n' < n c ) is 
to be specified. W is independent of all these and has the same cardinality as (Vi, V 2 ). We 
define X\ and X 2 as follows 

X x = V 1 + Ui + Z x + 5i, 3 + 5-;, 

X 2 = V 2 + U 2 + Z 2 - S n ^- n ^S li3 + 5^3 

with the result that 

y 3 = S »-"i.3(F 1 + Zi) + S n -™ 2 ' 3 (^ 2 + U 2 ) + 5! 

r 4 = s n ~ n ^(v 2 + u 2 + z 2 ) + s n - na ' s (Vi), 

where we defined 

<^ 4£f ^gn— ni,3 gn— n2,3+n2,4— ni,4^^ ^ _|_ g"-«.2,3g-L 

We set ru x = and the conditions on the other rates (after removing the redundant ones) 
are 

r Sl < [nc ~ max(7ii >3 ,7ii )4 )] + 
rvx + r Ux + r Zl + r Sl < n c , 

rv 2 < n' c , 

r Zl < [ni, 3 - n 1A ] + , 
ru 2 + r z 1 < max([ni, 3 - ^1,4] + , n 2 ,3 - n' c ), 

r +r < f max([ni,3 -n 1A ] + ,n 2j3 - (n 2 , 4 -ni, 4 )), if ni j3 + n 2 ,4 ^ ni, 4 + n 2 , 3 
r Si r z 1 — \ maxQni^ — 711,4]+, n 2) 3 — 712,4), otherwise, 
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r + r +r < I max ([ n i>3 - n i,4]+, ^2,3 - n' c , n 2:3 - (n 2 ,4 - "1,4)), if "1,3 + n 2,4 n \A + n 2,s 
ru 2 r s 1 rZl - { m&x([n li3 - n 1A ] + ,n 2j3 - n' c ,n 2t3 - n 2A ), otherwise, 

(r Vl + r V2 ) + r U2 + r Sl + r Zl < max(ni 53 , n 2 , 3 ), 

rz 2 < ["2,4 - "2,3]+, 
ru 2 + r z 2 < max([n 2j4 - n 2>3 ]+, n 2A - n' c ), 
OVi + r Va ) + r U2 + r z , 2 < max(n 2j4 , n lA ). 

By Fourier-Motzkin elimination, we may conclude that an achievable R\ + R 2 is given by 
the smaller of u 2 (n c ), u 3 (n c ), u^nc), u 5 and 

[n c - max(»i [3 , "1,4)]+ + max(n 2 , 4 , n 1A ) + max([ni, 3 - n 1A ] + , n 2fi - n' c ) + n c + n' c + [n 2A - n 2 , 3 ] 4 

2 

The last term above can be shown to be not smaller than the minimum of u 2 (nc), u 3 (nc), u±{nc) 
and m 5 if n' c is chosen to be such that u\(n' c ) = mm(u 2 (n c ) , u 3 (n c ) , u^ric) , u 5 ) if «i(0) < 
min(w 2 (0), u 3 (0), u 4 (0), U5), and n' c = otherwise. Note that from earlier discussion, we 
know that this choice of n' c must be less than or equal to nc- 

For (2) n 2A < 711,4, we apply Theorem [3|c) with the same auxiliary random variables 
W, Ui, U 2 , Vx, V 2 , Zi, Z 2 , S 23 , S[ as in case (1) above. But instead of Si ;3 , we define S 2:3 
which is independent of all these random variables and distributed uniformly over ¥ n . We 
define X\, and X 2 as follows 

x x = v 1 + z 1 + s n ^- n ^s 2 , 3 + S[, 

X 2 = V 2 + U 2 + Z 2 — S 23 + S 23 



with the result that 



y 3 = s n - ni - 3 (yi + z x ) + s n " n2 - 3 (V2 + u 2 ) + St, 

y 4 = S n - n2 - 4 (V2 + U 2 + Z 2 ) + S n - ni - 4 (Vi), 

where we define S\ as 

4£. f ^g«-«l,3+™l,4-n.2,4 _ Q n ~ n2 ' 3 ^S 2 3 + S n_n2 ' 3 

The conditions on the rates (after removing the redundant ones) are the same as in case (1) 
except for the following two 

r +r < f max([ni )3 - ni, 4 ]+, n 1)3 - (n 1A - n 2)4 ), n 2)3 - n 2 , 4 ), if ni )3 + n 2j4 7^ ni, 4 + n 2 , 3 , 
r 5i r Zi _ I m ax([ni )3 — "1,4] + , n 2>3 — n 2)4 ), otherwise, 

?"[/ 2 + rs a + r Zl 

< f max([72i i3 - ni j4 ] + , n 2>3 - n' c , n lj3 - (n 1A - n 2A ),n 2 - 3 - 712,4), if "1,3 + «2,4 7^ "a,4 + n 2 , 3 , 
~ 1 max([7ii )3 - 711,4]+, n 2>3 - n' c , n 2)3 - n 2 , 4 ), otherwise. 
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By Fourier-Motzkin elimination, we may conclude that the achievable R\ + _R 2 is given by 
the smaller of u 2 (n c ), u 3 (n c ), u^ric), u 5 and 

[n c - max(ni i3 ,n M )] + + max(n 2 , 4 , n M ) + max([rei, 3 - n M ] + ,n 2;3 - n' c ) + n c + n' c + [n 2 , 4 - n 2 , : 

2 

The above term can be shown to be not smaller than the minimum of w 2 (^c)? u z{ n c)i U4(%) 
and M5 if n' c is chosen as was done in case (1) above. 

We can represent the two cases above together using the following notation: 

x 1 = v 1 + z 1 + s 1)3 + s[, 

where 



5i, 



S 



2,?, 



Sl,3, 

S ni > 4 - n2 ' 4 S 2i3 , 

_s^.4-"i.*s 1)3 + s^ 3 , 

— S 2;3 + S 2 3, 



if ™2,4 > ™1,4, 

otherwise, 

if n 2 , 4 > ^1,4, 
otherwise, 



with the result that 



r 3 = s^^iYx + z x ) + s n ~ n ^{v 2 + u 2 ) + s l5 
f 4 = s n ~ n ^(v 2 + u 2 + z 2 ) + s n - ni -*(yi), 



where 



Si 



drf f (S n " ni - 3 - S n - n2 - 3+n2 - 4 - ni - 4 )S li3 + S^^S^, if n 2>4 > n 1)4 , 
~ j ( S n-ni,3+m,4-n 2 ,4 _ S n ~ n2 ^)S 2i3 + S n - n2 . 3 S/ 3 , otherwise. 



We will use a symmetric form of this notation below for regime (iv). 

Regime (iv): Application of Theorem U\b) with the following auxiliary random variables 
covers this regime: U\, U2 are constants. ]/]_, V 2 , Zi, Z 2 , Sl j3 , S 2)3 , Si )4 , S 2] 4, S^g, Sf 4 , S i; S 2 are 
chosen to be independent and uniformly distributed over their alphabets. Also, we choose 
the alphabets to be V 1 , V 2 , S 1>3 , S 2)3 , S M , S 2i4 G F n , ^ G ^„ 1>4 , Z 2 G S^ 3 G J" n2 4 , 

S^ 4 G JF ni 3 , Si G J r roa x(rn 1 3,fn,4), and S 2 G ^(^4,^,3). W is independent of all these and 
has the same cardinality as (Vi, V 2 ). Further, we define X\, and X 2 as follows 

x x = Vi + u x + z x + Si, 3 + Si, 4 + s; 

A 2 = + ?7 2 + Z2 + S 2j 3 + S 2j 4 + S 2 , 
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where 



5*1,3 = 

5 2 .3 z 

5 2 .4 z 
5*1,4 = 

The upshot of this is that 



51,3, if ^2,4 > ^1,4, 

Sni,4-n 2 ,4 j 5 2)3j otherwise, 

-S n ^~ n ^S h3 + % if n 2)4 > n M , 
-5 2 ,3 + 5^, 

5*2,4, 

S n 2 ,3-m,3S M) 



otherwise, 

if ni )3 > n 2 ,s, 
otherwise, 



-S 



Hl,3 _ "2,3 



5 2 ,4 + 5^, if ni j3 > n 2 , 3 , 



■5i,4 + 5 lj4 , 



otherwise. 



Y 3 = S n - ni - s (Vi + Zi) + S n -" 2 > 3 \/ 2 + 5i 

(y 2 + z 2 ) + s n - na - 3 v 1 + s 2 , 



Ya 



jn-n 2 ,4 



where 



5i 



def 



def 



(gn-m.3 _ S n-n2, 3 +n2,4-n 1 , 4 ^ i 3 + gn-n 2 , 3 gO^ jf ^ > ^ 

( S n-m, 3 +ni,4-n2,4 _ S n - n2 .«),S 2 , 3 + S n ~ n2 - 3 S^, otherwise, 

( S n-n 2 , 4 _ gn-ni,4+ni,3-n a , 3 ) j 5 2)4 + S™- 4 ^, if n 1>3 > 712,3, 

( S n-n a ,4+n a ,3-ni, 3 _ S"- ni > 4 )s 14 + S" - " 1 - 4 ^, otherwise. 



With these choices, the conditions on the non-negative rates r Vl , ry 2 , r Sl , r S2 , r Zl , r Z2 after 
removing redundant conditions are 

rsi < [nc - max(n lj3 ,n li4 )] + , 
rvi + r Zl + r Sl < n c , 



r Zl < [ni >3 -n M ]+, 
+ r Zl < max([ni >3 - 711,4]+, n Sl ), 
(r Vl +r V2 )+ r Sl + r Zl < max(n li3 , n 2 , 3 ), 

and the corresponding inequalities with subscripts 1 and 2 exchanged, and 3 replaced by 4, 
where 

' max(ni, 3 , n 2 , 3 - (n 2 , 4 - n M )), if n 2 , 4 > n M and ni, 3 + n 2 , 4 ^ n 1A + n 2 , 3 , 
n Sl = < max(n 2 , 3 , ni, 3 - (n M - n 2 , 4 )), if n 2 , 4 < ni, 4 and ni, 3 + n 2 , 4 7^ 7ii j4 + n 2 , 3 , 
k (712,3 ~ n 2A )+, if ni, 3 + n 2 , 4 = ni, 4 + n 2 , 3 . 

We may apply Fourier-Motzkin elimination to obtain the sum-rate supported by this scheme. 
We get a sum-rate which is the minimum of u 2 (nc), ^3(^0), u±{nc), «5 r This completes the 
achievability proof. 
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C Proof of achievability of Theorem [2 

We prove Theorem [2] using Theorem |4| Note that we proved the latter for discrete alphabets, 
but the extension to the continuous alphabet case is standard and we will assume that version 
in this section. This proof will follow the proof of Theorem [T] closely. We first make the 
following definitions: 

"fci.te — [l°g|^fci,fe| 2 ]+> h,k 2 G {1,2,3,4}, and 
n c = [\og\h c \ 2 ] + . 

First, we observe that the following four terms Ui,u 2 ,u 3 , and u' 4 are within a constant (5 
bits) of the corresponding unprimed terms, Ui,u 2 ,u 3 , and w 4 , respectively 

u[ = max(n li3 - n M + n c , n 2)3 , n c ) + max(n 2i4 - n 2 , 3 + n c , n 1A , n c ), (14) 

u' 2 = max(n 1)3 , n 2 , 3 ) + (max(n 2j4 , n 2j3 , n G ) - n 2 ,s) , (15) 

u 3 = max(n 2)4 , 711,4) + (max(ni, 3 , ni,4, n c ) - 711,4) , (16) 

u' A = max(ni, 3 , n c ) + max(n 2 _ 4 , n c ). (17) 

Hence, it is enough to show that the minimum of the four terms above and 
u' 5 = log ( 1 + (|/ii, 3 | 2 + \h 2A \ 2 + \h lA \ 2 + \h 2j3 

(18) 



+ (|^l,3^2,4| 2 + \hl A h 2 ^\ 2 - 2|/ii, 3 /i 2 ,4/ii,4/i 2j3 | COS 6») j , 



which is within a constant (3 bits) of U5, is achievable. We again consider the same four 
regimes as in the proof of Theorem [TJ 

Regime (i): nc < n m [ n = min(ni, 3 , ni, 4 , n 2j3 , n 2) 4). The discussion for regime (i) in the linear 
deterministic case continues to hold here as well. Note that U5 is such that 

_ 5 < » def f max(rii i3 + n 2 , 4 , n x , 4 + n 2 , 3 ), if ni, 3 - n 2j3 ^ n 1A - 7i 2 , 4 , 
— 5 1 max(ni i3 , n 2j4 , ni j4 , n 2j3 ), otherwise. 



Thus, when condition (13) does not hold, the achievability (within a gap of 7 bits from the 



upperbound) is implied by the results of Etkin-Tse-Wang [2] (where 2-bit gap comes from [2] 



and an additional 5 bits were incurred above). And, when condition (13) holds, we need 
only show achievability in the restricted regime of nc where 

1*1(710) < mm(u 2 (nc),u' 3 (nc),u 4 (n c ),ul). 

We employ Theorem[4](a) using the following auxiliary random variables W, Vi, U±, Zi, V 2 , U 2 , Z 2 
are zero-mean Gaussian random variables and independent of each other with the following 
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variances: 



< = < = UK, 

1/K 



a u 1 = a u 2 



max(l, \h c \Y 



2 l l K A 

a Zi = 7TJT. — 12V and 

1 max(l, |/ii,4| 2 ) 



a 



1/K 



Z2 max(l,|/ i2i3 | 2 )' 

where if is a constant which will be specified soon. W is independent of all these and has 
the same distribution as (Vi,V 2 ). X\ and X 2 are defined as 

X 1 = V 1 + U 1 + Z 1 , 

x 2 = v 2 + u 2 + z 2 . 

In order for the power constraint to be satisfied, it is enough to have K < 3. This defines 
VwVv 1 ,u 1 ,x 1 \wVv2,U2,x 2 \w ■ These choices are such that the "private" signal Z\ appears at 
destination 4 with less power than the noise, and, similarly, Z 2 appears at destination 3 
with less power than the noise. With these choices, the conditions on the non-negative rates 
rvi , r V2 , r Vl , r Ua , r Zl , r Z2 are 

, ( \hc\ 2 /K\ 
r Vl < log 1 + 1 



rz 



2/K + lJ 
( |^ 3 | 2 /(max(l,|^ 4 | 2 )if) \ 

1 - l0 H Y/KTi ) ' 

/i li3 | 2 /(max(l, \h c \ 2 )K) + |/i li3 |7(max(l, \h lA \ 2 )K)' 



ru! + r Zl < log ^1 + 
r U2 + r z L < log 1 + 



1/K + l 

|/i li3 | 2 /(max(l, \h lA \ 2 )K) + |fr 2 , 3 | 2 /(max(l, \h c \ 2 )K) ~ 

1/K + l 

\hi, :i \ 2 , |fei,3| 2 , \h2, 3 \ 2 



+ 



r- M r A.r- <T lncr I 1 _l_ """(Ll^cl 2 )* T majc (l,|ft 1|4 |2)g T max(l,|fe c | 2 )^ 

+ r i/ 2 + S log | 1 H 1/K + 1 

|fel,3| 2 , |fel,3| 2 , |fel,3| 2 , |fc 2 , 3 | 2 , |fc2,3| 2 \ 

t- + r + r + r + r < lnr I 1 I * max(l,|fe c P)X + EMx^hi^Jfjjf + g max(l,|M 2 )^ 
Mi + Mj i ' ; " : 'cr ; zj S u '8 | 1 H lT^KL 

Simplifying, we can show that these conditions imply that non-negative rates which satisfy 
the same conditions as in the linear deterministic case (up to a constant) are achievable. 

?Vi <n c - log 5 

r Zl < K,3 -«i,4]+ - log 4, 
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Tux + r Zl < ni )3 -n c - log 4, 
rc/ 2 + r Zl < max(ni i3 - n M , n 2<3 - n G ) - log 4, 
fv x + r U2 + r Zl < max(n li3 - n c , n 2 - 3 - n c ) - log 4, 
rvi + r v 2 + r U! + r U2 + r Zl < max(ni )3 , n 2j3 ) - log 4, 

and the corresponding inequalities with subscripts 1 and 2 exchanged, and 3 and 4 exchanged. 
Note that the right hand sides above should be interpreted as zero if they evaluate to less 
than zero. We will tacitly assume this for similar conditions in the sequel. Further, we make 
the following choices for the rates. 

r Zl = [n 1)3 - n 1A } + - log 4, 
r Z2 = [n 2j4 - ^2, 3 ]+ - log 4, 
rvi = r v 2 =n c - log 5, 

r Vl = max(n 2i4 - n 2 , 3 , ^i,4 - n c ) - [n 2 A ~ ^2, 3 ]+ - log 4, and 
r U2 = max(ni j3 - n M , n 2 , 3 - nc) - K, 3 - n 1A ] + - log 4, 

where we interpret the rates as zero if their values work out to less than zero. It can be 
shown that under the restricted regime of nc, these choices satisfy all the conditions above. 
The resulting sum-rate is u'^nc) within a constant gap (of at most 13 bits) as required. 



When condition (13) does not hold, as we mentioned earlier, it is enough to prove that 
the nc = is achievable. We may invoke the achievability proof of Etkin-Tse- 

Wang [2] to conclude that a sum-rate which matches our upperbounds (up to a constant) is 
achievable. Thus, overall, in regime (i), we may conclude that the upperbound is achievable 
within a constant gap of 18 bits. 

Regime (%%): n m [ n < nc < min(ni i3 , n 2j 4). As in the linear deterministic case, the achievability 
in this regime is implied by the achievability in regime (i). 

Regime (Hi): min(ni )3 , n 2A ) < n < max(ni i3 , ra 2)4 ). Without loss of generality, let us assume 
that n 13 < nc < n 2l 4. We will apply Theorem fflc) to establish the achievability. We 
consider two separate possibilities: (1) \h 2A \ > \hi A \ and (2) \h 2A \ < \hi A \. 
(1) When \h 2A \ > \hi A \ (which implies that n 2A > n li4 ) , the auxiliary random variables 
are as follows: U\ is set to a constant. U 2 , Vi, V 2 , Z\, Z 2 , Sij, S 23 and S[ are independent 
zero-mean Gaussian random variables. Their variances are as follows 



^ 2 ^2 

°Vi = a v 2 


= 1/K, 


< 


l/K 
max(l, \h' c \ 2 ) ' 




l/K 
max(l, \hi t 4 2 ) 


< 


l/K 
max(l, \h 2 ,3 2 ) 


4, 


= 1/K, 
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1JK 

5 i max(l,|/ ll) 3| 2 ,|/ii l4 | 2 )' 
5 m max(l, |/i 2)4 | 2 )' 



where .K" and h' c (0 < /i' c ) will be specified. Let us define n' c = f [log/i /2 7 ] + . We will pick a 
such that n' c < min(nc, ^2,3)- We set W to be independent of all these and have the same 
distribution as (Vi, V 2 ). We define X ± , and X 2 as follows 

x 1 = v 1 + z 1 + s lt3 + s[, 

X 2 = V 2 + U 2 + Z 2 — - — rr~~ — 1 5*1,3 + S-23- 



This satisfies the power constraint if K < 5 (where we used the fact that \h 2 ^\ > |^i,4|)- Let 
Si = (Si,3, ^3) and 



Then 



n = |^i, 3 |(vi + Zi + 50 + ^ + ^3, 

n = IM(^ 2 + c/ 2 + 30 + IMe^Vi + ^ + so + \h 2A \S^ + iV 4 . 
The conditions on the rates are 

r Sl < log (l + |/i c |7(max(l, |/i 1;3 | 2 , |/i M | 2 )iO) , 





2 


max(l, \h lj3 \ 


2 , 1^1,4 


2 )if 



I /I 



c 



12 



M 2 , \h c ? \h 



\h c 


2 


max(l, |/i 1)3 | 




2 )if 



ry, +r Zl +r Sl < log 1 + — + — — + 



?V 2 < log 1 



■c\ 



12 





2 /K 




2 /(max(l, \h> c \*)K) + \h c 


2 /(max(l, |/i 2>3 


2 )K) + 1 



|/i 1>3 |7(max(l, \h lA \ 2 )K) + |/i 2 , 3 |7(max(l, \h' c \ 2 )K)' 



r U2 + r Zl < log ( 1 + 
rsi + »"Zi < log 



2/K + l 

( |fel, 3 | 2 . ||fel,3l|fe2,4|-|fel,4||fe2,3|e je | 2 |fe 2 , 3 | 2 \ 

1 max(l,|fc M | 2 )if + \h 2A \ 2 K + max(l,|fo 2 , 4 | 2 )K 

2/K+l 

V / 



32 



ru 2 + + r Zl 



( 



<log 



1 + 



|fel, 3 | 2 , ||fel,3||fe2,4|-|fel,4||fe2,3|e^| 

2 ~r i j, _ . 1 2 v t 



max(l,|/i M | 2 )X 



|ft2,4| 2 /!T 



|fe2,3| 2 , |fe2,3| 2 \ 

max(l,|/i 2 ,4| 2 )-^ ~ T " max(l,|/i^| 2 )X 



V 



(rvj + ry 2 ) + r U2 + r 5l + r Zl 



<log 



/ l^i.sl 2 



1 + 



+ 



2/K + l 

||fel,3l|fe2,4|-|fel,4||fe2, 3 |e^| 2 Jft^al 



/ 



K 1 max(l,|/ii i4 | 2 )K 



|fe,4| 2 ^ 



, |fe2, 3 | 2 \ 

K 1 max(l,|/i 2 , 4 | 2 )^ majt(l,\h' c \ 2 )K 



2/K + l 



r U2 + r z 2 < 



r^<]og(l + ^W^ ), 

/ |fe 2 , 4 |V(max(l, |/^| 2 )if ) + |fr 2 , 4 | 2 /(max(l, |/^ 2 , 3 | 2 )^ \ 



(rvx + rv 2 ) + r U2 + r Z2 < log 1 + 



|fe 2 ,4| 2 _ _ 

K t max(l, \h'\ 2 )K ^ max(l,|/i 2 ,3l 2 )^ ^ K 



, |fe2,4| 2 , |fe 2 ,4| 2 , |fel,4| 2 \ 

r ij,' I2^ at- ~t „,„„n i»,„„i2Mi' ~r ^ 



3/K + l 



) 



Upon simplification, the above conditions imply that non-negative rates which satisfy the 
conditions below are achievable. 

r Sl < [nc ~ max(ni )3 ,ni )4 )] + - log 5, 
rvi + r Zl + r Sl <n c - log 5, 

r V2 < n' c - log 7, 

r Zl < [n h3 - n 1A ] + - log 7, 
ru 2 + r Zl < max([ni >3 - n 1A ] + ,n 2;3 - n' c ) - log 7, 



rsi + r Zl 
< log ( 1 + 



hi 



,3 



max(l, |/i M |) 
^c/ 2 + r Sl + r Zl 



+ 



|^l,3||^2,4| - |/il,4||/i2,3|e- 



h 



2,4 



+ 



h 



2,3 



max(l, \h 2 ,4) 



log 7, 



< log 1 + 



2,3 



max(l, \h' c \) 



to 


^1,3 


2 


+ 


+ 







IMIM ~ \hi A \\h 2>3 \e^ 
h 2 A 



+ 



h 2 , 3 
max(l, \h 2 ,4) 



log 7, 
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(r Vl + r v , 2 ) + r U2 + r Sl + r Zl < max(m i3 , 712,3) - log 7, 



r Zi < ["-2,4 - n 2>3 ) + - log 8, 
ru 2 + r z 2 < max([n 2 , 4 - 712,3]+, "2,4 - n' c ) - log 8, 
(r Vl + r V2 ) + r U2 + r Za < max(n 2>4 , n M ) - log 8. 

Note that the conditions on the rates are exactly as in the linear deterministic case up to 
a constant except for the constraints on + r Zl and Ty 2 + rs x + r Zl . By Fourier-Motzkin 
elimination, we may conclude that a sum-rate R\ + i? 2 within a constant (9 bits) of the 
smaller of u 2 (nc) , u' 3 (ric) , u' 4 (ric) , u' 5 and 

[n c - max(m )3 , n M )] + + max(n 2)4 , 711,4) + max([m )3 - ni i4 ]+, n 2j3 - n' c ) + n c + + [ ra 2,4 - ^2,3] 



is achievable. The last term can be shown to be not smaller than the minimum of u' 2 (nc) , u' 3 (nc) , u'^nc) 
and u' 5 , if n' c is chosen to be such that ui{n' c ) = min(w 2 (nc), u 3 (nc), w 4 (nc), u$) when 
Wi(0) < min(w 2 (0), w 3 (0), w 4 (0), w 5 ), and n' c = 0, otherwise. Note that from earlier discus- 
sion, we know that this choice of n' c must be less than or equal to nc and all n^, i G {1, 2}, 
j G {3,4}, and in particular n 2 3 . 

(2) When \h%^\ < \hi A \ (which implies that n 2j4 < rii j4 ), we apply Theorem^c) as in case (1) 
above with the same choices for the auxiliary random variables W, Vi, V 2 , Ui, £/ 2 , Z\, Z 2 , S 23 , S[. 
But, instead of 5i, 3 we now define an independent, zero-mean Gaussian random variable £ 2 ,3 

with variance a~ = 1/K. The conditional distributions of X\ and X 2 are defined through 

S2.3 



x 1 = v 1 + z l 



\h 



2,4 



;S 



2.3 



q/ 



X 2 = V 2 + U 2 + Z 2 + £2,3 + S 23 , 

which satisfies the power constraint if we set K < 4 (since |/i 2 , 4 | < |^i, 4 |)- 
{£2,3, S 23 ) 



We define 



Si 



and 

Si- 



h2^ /2 



\ h i,3\ 








1^1,4 





S2,3 + \h2,3\e j9/2 S, 



2,3- 



The joint distribution of the signals received at the destinations is given by 

Y 3 = |/ii >3 |(Vi + Z 1 + S[) + \h 2 ,3W\V 2 + U 2 + Z 2 ) + Si + N 3 , 

Y A = \h 2A \(V 2 + U 2 + Z 2 ) + \h XA \e? e (V x + Z 1 + S[) + |/i 2 , 4 |^ 3 + iV 4 . 

The simplified conditions on the rates are identical to those in case (1), except for the 
following two 



rsi + r Zl < log 
ru 2 + + r Zl < log 



('♦ 


hi,3 
hi A 


2 

+ 




/l 2 ,4 

h'c 


2 

+ 




log 6. 
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Applying Fourier-Motzkin elimination and choosing n' c as in case (1) completes the achiev- 
ability proof (to within 9 bits of the upperbound) 

Thus, we may conclude that the upperbound is achievable with a gap of at most 9 bits 
in regime (iii). Note that we may represent the two cases together as follows: 

x 1 = v 1 + z 1 + s li3 + S[, 

X 2 = V2 + U2 + Z2 + $2,4, 



where 



Si, 



Si, 



\ h 2,4\ Q 
rO' 

|fel,4|e^/ 2 



\hi A \ej«/ 



2 °2,3, 



if 1^-2,4! > \hi t 
otherwise. 



s- 



2,3 



\h2,4\ 
^2,3 + S23, 



Si,3 + <S£ 3 , if \h 2A \ > {h 



4 j 



otherwise. 



This gives 



Y 3 
Y 4 



|/ii, 3 |(Vi + Z 1 + S[) + \h 2 , 3 \e je ' 2 (V 2 + U 2 + Z 2 ) + S 1 + N, 



3- 



h 2A \(V 2 + U 2 + Z 2 ) + |/i2,3|e^ /2 (K! + Z x + 50 + |/i 2 ,4|^ 3 + A^4, 



where, we define S\ = (S 2t3 , Si >3 , S 23 ) and 

5i = \h 2 , 3 \e^ 2 S 2 , 3 + \h 1>3 \S x , 3 + \h 2 , 3 \e^ 2 S^ 3 . 

The distribution of S\, S 2 we will employ in regime (iv) below is a symmetric generalization 
of this. 

Regime (iv): max(ni j 3, 712,4) < nc- in this regime, we employ TheoremQb) as we did for the 
linear deterministic case. Ui, U 2 are constants. Vi, V 2 , Z\, Z 2 , S\$, S 2 ,Zi Si,4, S 2 ,a, S 23 , Si 4 , S[, S 2 
are independent zero-mean Gaussian random variables. Their variances are as follows 



a 



1 2 



1/K, 



1/K 



'z 2 



max(l, |/ii,4| 2 ) 

1/K 
max(l, \h 2j3 \ 2 ) 



a 



°1A 



a 



1/K, ie{l,2}, JG{3,4}, 

1/K 
max(l, \hi y3 
1/K 



2V 



2,3 
2 



max(l, I /i2,4 1 2 ) 
1/K 



2 

°2 



max(l, |/ii, 3 | 2 , |^i,4| 2 )' 
1/K 

max(l, \h 2A \ 2 , |^ 2 ,3| 2 )' 
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where K is to be specified. We set W to be independent of all these and have the same 
distribution as (Vi, V 2 ). We define X ± , and X 2 as follows 

x 1 = V x + Z x + S h3 + S 1A + S[, 
X 2 — V 2 + Z 2 + S 2A + 5*1,3 + 5 2 , 



where 



, Si,3, if \h 2 ,4 > \hi,4, 

1 -ji^0mS 2 , 3 , otherwise. 

^ 3 = J ~ l>ll \h 2 e 4 \ Sh3 + S 2,3i lf I ^2,4 1 > | ^1,4 1, 

5*2,3 + 5^, otherwise. 

5*2,4, if \hi :3 \ > \h 2 , 3 \, 

2-1 " ] ~^0mS 1A , otherwise. 



S 



s 14 = I r'^Kai s ^ + s i^ if 1^1,3 1 > l ^2,3 1, 

I 5i 5 4 + S*^ 4 , otherwise. 

To satisfy the power constraint, it is enough to choose K < 7. Also, we define S*i = 

(^2,3, 5i )3 , 5 23 ), 5 2 = (5*1,4, 5*2,4, 5 14 ), and 

51 = \h 2 , 3 \e^ 2 S 2 , 3 + |/ii, 3 |5i, 3 + l^sl^ 2 ^, 

5 2 = |/ii,4|e^ /2 5i, 4 + |/i 2 ,4|5 2 ,4 + \h 1A \e je ' 2 St A . 

Thus, Si and S 2 are independent of each other. Note that we defined 5*i i3 and 5* 2i 3 in the 
same way as we did in regime (iii). The destinations receive 

Y 3 = |/n i3 |(Vi + Z 1 + S[) + |/i 2 ,3|e J ' e/2 (^2 + Z 2 + 5 2 ) + 5i + | ^1,3 j + ^3, 
Y 4 = \h 2A \(V 2 + Z 2 + S' 2 ) + |/ii,4|e^ /2 (yi + Zi + S[) + S 2 + \h 2A \S^ 3 + N 4 . 

It must be noted that |/ii,3|SV~4 an d l^2, 4 |5^" 3 have variances at most unity (which is the 
variance of the noise). Since U\ and U 2 are constants, we must set = ru 2 = 0. The 
conditions on the non-negative rates are as follows. 

r Sl < log (l + |/i c |7(max(l, \h li3 \\ \h 1A \ 2 )K)) , 
r Zl + r Sl < log (l + |/i c | 2 /(max(l, \h 1A \ 2 )K) + |/i c | 2 /(max(l, |/n, 3 | 2 , |/> M | 2 )^)) 

TVi + rzi + r Sl < log 1 + — — + — — — r— + 



\hc 
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max(l, \h h3 \ 


2 , I^mI 


2 )x 
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rsi +r Zl < ' 



log 



log 



1 + 



1 + 



1^1,3 1 



1^1,41 



1^1,3 1 1^2,4 1 — l^l,4l 1^2,3 1 



|h2,4l 



\h 2 ,3\ 



''2,4 



\/K 



\ 



|ftl,3l 
1^1,41 



Ifcl,3l|fc2,4l-Ifcl,4l|fc2,3l eig 
\ h l,4\ 



l fc 2,3 1 
h 2,4l 



]/K 



4/K+l 



, if |/i2,4| > \hi,4 



, otherwise 



( 



< { 



log 



log 



Ksl 2 



1 + 



1*1,31 



1*1,4 1 



*l,3ll*2,4|-|*l,4ll*2, 3 l e ' 



1^2,4 1 



1^2,3 



1^2.4 



/A" 



\ 



M 2 



1 + 



1^1,3 I 
1^1,4 1 



+ l^2, 3 | 2 



4,/K+l 



Ihl, 3 l|h2,4l-I''l,4l|h2,3l e: ' < ' 
l*l,4l 



1*2,3 
1*2.4 



/AT 



4/A+l 



V 



, if \h 2) A\ > \h 1A \ 



, otherwise 



/ 



and the corresponding conditions with 1 and 2 interchanged and 3 and 4 interchanged. On 
simplifictaion, this implies that non-negative rates which satisfy the following conditions are 
achievable. 

rs! < [nc ~ max(m >3 ,ni i 4)]+ - log 7, 
rv! + r z 1 + r Sl <n c - log 7, 

r Zl < [ni,3 - ni >4 ]+ - log 11, 



r Sl + r Zl < 



log (1 + 
log (1 + 



|hi,s| 



1^1,41 

\hi, 3 \ 



\hi A \ 



+ 



+ 



|fel, 3 ||fe2,4|-|fel,4||fe2, 3 |e^ 
\h2 A \ 

|fel, 3 ||fe2,4|-|fel,4||fe2, 3 |e^ 



1^1,41 



+ 



+ 



I ^2,3 1 



1^2,41 
1^2,3 1 



I ^2,4 1 



- log 11, if \h 2A \ > \h 1A \ 

— log 11, otherwise, 



(r Vl + r y J + r Sl + r Zl < max(ni j3 , n 2 , 3 ) - log 11, 

and the corresponding conditions with 1 and 2 interchanged and 3 and 4 interchanged. Note 
that these conditions are identical to the ones for the linear deterministic case (up to a 
constant) except for the ones on rs 1 +r Zl and r$ 2 +r Z2 . Fourier-Motzkin elimination shows 
that a sum-rate given by the minimum of u' 2 (nc) , u' 3 (nc) , u' 4 (nc) , and u' 5 is achievable in 
regime (iv) up to a constant of 7 bits. 

Thus, we can conclude that the upperbound is achievable within a constant gap (of 18 
bits). 
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D Source cooperation: upperbounds 

We prove four upperbounds to the sum-rate which will together imply the upperbounds in 
Theorems Q] and El 

Upperbound 1: From Fano's inequality, 

T{Rt + R 2 - o(e)) < J(M i; Yf) + I(M 2 ; Y 4 T ) 

< J(M i; Y 3 T , h\ A (XT), h* 1:2 (Xf), K^Xl)) 

+ J(M 2 ; Y 4 T , hl^XZ), hl 2 (X-[), h^X?)). 

Note that we have provided additional signals to both the destinations - for instance, des- 
tination 4 now has access to hl A (X-[), h\ 2 {Xj), and ^i^)) m addition to its channel 
output Kj . We will now upperbound the two symmetric mutual information terms above; 
only the first term is shown below. 

/(M i; y 3 T , h* 1A (x?), hi ;2 (x^ ^(xT)) 

= H(h{ A (Xl), hi 2 {X? ), ^,i(* 2 T )) + H(Y 3 T \h* 1A (X?), hl 2 {X?), K^X*)) 

- H(Y 3 T , hl A (XD, h^(X?), ^(X^IMO 

< H(hl A (Xf), hl 2 (X?), h* 2tl (Xi)) + H{Yl\h\ A {Xl), h^X?)) 

- H(Y 3 T , h* lA (Xf), h* li2 (X?), h^X^Mj. 

We now derive an upperbound for the third term. 
H(Y 3 T , h\ A {Xl), h^X?), h^X^lM,) 

= ^ H(Y 3 (t), h^iXtit)), K^X^t)), h* 2<1 (X 2 (t))\Yt\ K^Xt 1 ), K^Xt 1 ), h^Xt 1 ), M x 



t=i 

T 

(a) - 



Y h (hi 3 (x 2 (t)), hUix^t)), K^x.it)), h* 2il (x 2 (t)) 



t=i 



x i^* 2)3 (X 2 l ),h* lA {X\ 1 ), 

hl 2 {X\- l ),hl 1 {X i 2 - 1 ) ) M 1 

t-i 

+ H(h* 2 . 3 (X 2 (t)), h* tl (X 2 (t))\Xl h^Xt 1 ), K A {X\- 1 ), hl^Xt 1 ), h^Xt 1 ), Mi) 

= E^(^(^i(*))l^i(*)) +H(hl 2 (X 1 (t))\Xi(t)) 
f=i 

+ H(hi 3 (x 2 (t)), h^ix.mh^ixt 1 ), K i2 {x\- 1 ), hi^xt 1 )) 

^YHihi^x.mxiW+Hihi^mMt)) 

t=l 

+ H(h* 2j3 (X 2 (t)), ht^X^t)), h^X.mhUXt 1 ), hUXt 1 ), hl^Xt 1 )) 
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> J2H(hl A (Xi(t))\Xi(t)) + H(hl 2 (X 1 (t))\X 1 (t)) 
t=i 

+ H(h* 2 , 3 (X 2 (t)), h* h2 (Xi(t)), h^X.mh^Xt 1 ), ht^Xt 1 ), h^Xt 1 )) 
- Hihl^mhUXt 1 )) 
= H(h^(XZ), hl 2 (X?), h* 2A (XT)) - H(hl 2 (X?)) 

+ (jimuMmMt)) +H(hi 2 (x 1 (t))\Mt)) ] j 

where (a) follows from the following facts: (1) Y 3 (s) = hi^(Xi(s)) + hl 3 (X 2 (s)) , (2) h^ 3 is 
a deterministic function, and (3) Xi(s) is a deterministic function of Mi, /^(X^ -1 ), for all 
s < t. Equality (b) can be seen from the channel model by which, conditioned on Xi(t), the 
following three sets of random variables are independent: (1) h\ 4 (Xi(t)), (2) h* l2 (Xi(t)), 
and (3) Mi and all other quantities with indices up to and including t. The next equality 
(c) is a consequence of the fact that the following is a Markov chain 

(M 2 ,X<,h^(X*)) - (hl^Xl- 1 ),^^- 1 )) - (Mi, X{). 

This follows from (1) the channel Phi^x^^x^^x^x^ = P^ j2 (x 1 )|x 1 P/ l | 1 (x 2 )|x 1 P^ >3 (x 2 )|x 2 , 
(2) the independence of Mi and M 2 , and (3) the fact that the channel inputs depend deter- 
ministically on the messages at the respective sources and what these sources have received. 
Equality (d) is just the chain rule of entropy, and the inequality (e) follows from the non- 
negativity of mutual information. 
Combining everything, we have 

t(Ri + r 2 - o(e)) < ( {#(>f K, 4 prf), hl,(XD) - E H(h* 2t3 (X 2 (t))\X 2 (t))} 

+ ^(hl^Xf)) -^H(hl 2 (Xi(t))\Xi(t))^ 

+ ^H(y?\hZ fl (xZ),hi*(x?)) -E H(h*i A (Xi(t))\Xi(t))} 

+ ^(h^Xj)) -^H(h^(X 2 (t))\X 2 (t))^. 

Linear deterministic model: Evaluating the bound directly gives us 

Ri + R 2 < max(ni j3 - n M , n 2j3 - n c , 0) + n c + max(n 2i4 - n 2j3 , n M - n c , 0) + n C - 
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Gaussian model: Consider the first bracketed term. 

T 



1 

f 



< ^H(Y 3 T - h^h^h\ A {Xi) - h 2 ^h^{Xi ■)) - H(N 3 ) 



t=i 



T 



H(N 3 - h, 3 h^\N 4 - h^lN,) - H(N 3 ) 



< log 1 + 



hi, 3 



hi,4 



+ 



l 2,3 



h 2 A 



1 

f 



H(hZt(XT))-'£H(K fl (X 1 (t))\Xi(t)) 



t=i 



^H(h h2 X? + N%) - H(N 2 ) 



<log(l + |/i li2 | 2 ). 

If \hc\ < 1, we do not subract the term ^2,3^2,1^2,3(^2") "while upperbounding the first 
term. This gives 

^\mys T \hU x i)^h( x 2 T ))-l:mUMmx2(t))) 



1 



t=l 



< -H{Y> - h Xfl K£h\ A {Xl )) - H(N 3 ) 

= H(Ni - h^h^Nj + h 2 , 3 X%) - H(N 3 ) 
2 



< log 1 



hi,. 



hi. 



+ \h 



2,3 1 



And similarly, we do not subtract the term hi^h^hl^Xf) if \hi^\ < 1 which gives the 
following upperbound for the first term. 



1 

f 



H(Y 3 T \hl 4 (X^),hl 1 (X^))-Y,mhUX 2 mX2(t)) 



t=l 



< log 1 + \h h3 \ 2 + 



h 



2,3 



h 2 ,i 



If both |/ii j4 | < 1 and \hc\ < 1, we do not subtract either terms in which case we get the 
upperbound 



1 
T 



//(y 3 T |^ 4 (xn,^ 1 (x 2 T ))-E^2 ! 3(^2(0)l^ 2 W) 

<log(l + (|^3| + |^2,3|) 2 ). 
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Similarly, upperbounding the second bracketed term and combining, we have 

R X + R 2 < log | 1 + f + I VI V ] (1 + \ hc?) 

\max(l, |/ii )4 |) max(l, \h c \) J 




Upperbounds 2 and 3: We prove upperbound 2 below, the third one follows from a similar 
argument. From Fano's inequality, 

T(i2 1 + i2 2 -o(e)) 

</(M i; y 3 T ) + /(M 2; y 4 T ) 

< /(M i; y 3 T ) + /(M 2 ;F 4 T ,^ i3 (X 2 T ),/ i * li2 (X 1 T ),^, 1 (X 2 T ),M 1 ) 

< /(M i; F 3 T ) + /(M 2 ;F 4 T ,^ 3 (X 2 T ),/ i t i2 (X 1 T ),^ 2 , 1 (X 2 r )|M 1 ) 

+ J(M 2 ; Y 4 (t), h*, 3 (X 2 (t)), hl^X.it)), h* 2jl (X 2 (t))\Yt\ h*^ 1 ), K^ 1 ), h*^' 1 ), Mi). 
Below, we upperbound these terms separately. 

I{M l] Y^yt 1 ) = H{Y,myt 1 )-H{Y^t)\Yt\M l ) 

< H(Y 3 (t)) -H(Y 3 (t)\Yt\Yt\hl 2 (X{~ 1 ),hl 1 (Xt 1 ),M 1 ) 

< H(Y 3 (t)) - H(h^(X 2 (t))\yt\ hUXt 1 ), K^Xt 1 ), h^Xt 1 ), MO, 

where (a) follows from the fact that Y 3 (t) = hi :3 (Xi(t)) + h 23 (X 2 (t)), and, hi )3 is a deter- 
ministic function and Xi(t) is a deterministic function /i )t of (Mi, /i 2 1 (X 2 ~ 1 )). 

J(M 2 ;F 4 (t), ^, 3 (X 2 (t)), /^(X^)), /^(X^))^" 1 , ^(X*- 1 ), ^(X^ 1 ), h^Xt 1 ), Mi) 
= J(M 2 ; ^(^(t))!^- 1 , ^(X*- 1 ), ^(X*- 1 ), ^(X^ 1 ), Mi) 
+ J(M 2 ; nWI^ 1 , ^, 3 (X*), /^(X^), ^^(X*- 1 ), Mi) 
+ J(M 2 ; ^, 2 (Xi(t)), ^i(X 2 (t))|F 4 *, ^ 3 (X<), /^(X^), ^(X^ 1 ), Mi) 
We upperbound these three terms separately now. 

J(M 2 ; ^(X^))^'- 1 , h^Xt 1 ), hl^Xt 1 ), ^(Xf 1 ), Mi) 

= H(h* 2i3 (X 2 (t))\Yt\ hl 3 {X\-% hl 2 {X[-% h^Xf), Mi) 

- H(h* 2i3 (X 2 (t))\Yt\ ^(X*- 1 ), K^Xt 1 ), ^(X*- 1 ), Mi, M 2 ). 
J(M 2 ; Y A (t)\Yt\hZ fl (X*), hl^X*- 1 ), h^iX*- 1 ), Mi) 

= H(Y 4 (t)\Yt\ h^Xt), hl^Xt 1 ), hl^Xt 1 ), Mi) 

- (y 4 (f)|l?-\ ^ 2 , 3 (X*), ^(Xj- 1 ), /^(X'" 1 ), Mi, M 2 ) 

<i/(r 4 (t)|Xi(t),^ 3 (X 2 (t)))-/J(F 4 (t)|Xi(t),X 2 (t)), 
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where (a) follows from the channel model (memorylessness and independence of the noise 
processes at the different nodes) and the fact that Xi(t) (X 2 (t), resp.) is a deterministic 
function / M (/ 2>t , resp.) of (M u h^X*' 1 )) ((M 2 , h^Xt 1 )), resp.). 

J(M 2 ; hl^X^t)), h^X^tWl h^{Xl), hl^Xl- 1 ), hl^X^ 1 ), Mi) 

^ J(M 2 ; ^ ;1 (X 2 (i))|y 4 *, ^ 3 (X*), ^pC*" 1 ), ^^(X*- 1 ), Mi) 
= F(^i(X 2 (t))|F 4 ', ^ 3 (X*), hl^Xt 1 ), h^Xt 1 ), Mi) 
- //(^ ;1 (X 2 (0)|y 4 *, h* 2t3 (X*), K^Xl' 1 ), h^Xf 1 ), Mi, M 2 ) 

</j(^i(x 2 (t))|r 4 (t),^ 3 (x 2 (t)),Xi(t))-iJ(^i(x 2 (t))|x 2 (t)), 

where (a) follows from the channel model (memorylessness and the independence of the noise 
processes at the different nodes) and the fact that Xi(t) is a deterministic function fi jt of 
(Mi, /i 2j i(X 2 _1 )), and (b) follows from the facts above, its analogue for X 2 (t) and the channel 
model. 

Combining everything, we have 

T{R 1 + R 2 -o{e))<j:\H{Y,{t)\Yt 1 ) 

t=i I 

- H{hl,{X 2 {t))\Yt\ h^Xt 1 ), h^Xi' 1 ), h^Xt 1 ), Mi, M 2 ) 

+ { J f/(y 4 (t)|Xi(t),^ 3 (x 2 (t)))-iJ(y 4 (t)|Xi(t),x 2 (t))} 

+ {H(hl 1 (X 2 (t))\Y4(t),hl 3 (X 2 (t)),X 1 (t)) -H(hZ tl (X 2 (t))\Mt))} 

Linear deterministic model: Suppose n c < max(n 2i4 , n 2)3 ), then evaluating the upperbound, 

Ri + R 2 < {max(m i3 ,n 2i3 ) - 0} + {[n 2 , 4 - n 2j3 ] + - 0} + {0 - 0}. 

Otherwise, if nc > max(n 2)4 , n 2)3 ) we have either: (a) nc > n 2 ^ > n 2j3 in which case, we 
lower the noise level at node 4 by nc — n 2)4 level by defining n 2 4 = n 2j4 + (n c — n 2;4 ) and 
rc'i 4 — n i,4 + ( n c — n 2,i)- Now the upperbound evaluates to 

Ri + R 2 < max(ni >3 , ra 2 , 3 ) + n c - n 2j3 = n c + [n lfl - n 2>3 ] + , 

or (b) n c > n 2 3 > n 2i4 in which case, we lower the noise level at node 3 by nc — n 2j3 by 
defining n' 13 = ni ;3 + (nc — n 2j3 ) and n 2 3 = n 2j3 + (nc — ^2,3)- Now the upperbound becomes 

#1 + R2 < max(ni >3 , ra 2 , 3 ) + n c - n 2j3 = n c + [n 1>3 - ra 2>3 ] + - 

Hence, without any conditions on nc, n 2i4 , n 2;3 , we have 

i?i + i? 2 < max(n 2)4 , n 2j3 , n c ) + [m >3 - n 2)3 ]+- 
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By symmetry, we also have 

Ri + R 2 < max(ni )3 , n M , n c ) + [n 2 ,4 - "1,4]+- 
Gaussian model: When \hc\ < max(|/i 2i3 |, |^2,4|), 

1 £ [HiY^tWt 1 ) - H{hl,{X 2 {t))\Yt\ h^Xt 1 ), K^Xt 1 ), h^Xt 1 ), Mx, M 2 )} 

<^E^(n(0)-^s(0) 



T 

<log(l+ (|/ll )3 | + l^2,3|) 2 ). 



\2\ 



1 ^{^(oixxw, ^,3(^2(0)) -E{Ymm\m))} 
1 t=i 



1 t 



< - J2 H(Y A (t) - h 1A Xi(t) - h 2A h^ 3 hl 3 (X 2 (t))) - H(N 4 (t)) 



T 



\ ^ H(N 4 (t) - h 2A h^N 3 (t)) - H(N 4 (t)) 
1 t=i 



< log 1 + 



h 2A 



^E{ff(^(x 2 (0)|n(0,^(^2(0)^i(0)-^,i(^2(*))l^2(0)} 

1 t=l 



1 T 



< min [-^fr^Wt)) - V^XatM*))) - ^(iViW), 



t=l 

T 



* g^ fl (X 2 (t)) - ^2,1^(^(0 - ^1,4^1 (0)) - HiN^t)) 



1 T 



1 T 



r E tfW) - ^,1^4(0) - ^(M(O)J 

< log (l + min(|/i c | 2 /|/i 2)3 | 2 , \h c \ 2 /\h 2A \ 2 )) 

< 1. 

Also, (when |/i 2 3 | < 1) we may upperbound the second term without subtracting h 2A h 2 ^ z h* 2 Z [X 2 (t)) 

^E{^ (^(01^(0, ^,3(^(0)) -#W)|Xi(i),x 2 (i))} < io g (i + im 2 ) . 

Thus, we have 

Ri + R 2 < log2(l + (|/i 1)3 | + IM) 2 ) + log I 1 + 



2\ , , l^2,4| 



max(l, |/i 2 ,3| 2 )/ ' 
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If \hc\ > \h 2 ^\ > I ^2,3 1, w e consider the following "enhanced" channel with channel 
coefficients indicated by primed quantities defined by 



h' 

"-2,4 


\hc\ 


^2,4 


\ h 2,i\ 


K A _ 


\hc\ 







It is easy to see that this is equivalent to lowering the noise variance at destination 4 from 
unity to \h 2 ^\ 2 /\h c \ 2 ■ Hence, an upperbound on the "enhanced" channel is also an upper- 
bound for the original. Thus, we have 



\l |2 

Ri + R 2 < log2(l + (\h lt3 \ + \h 2 j\f) + log I 1 + 



1*2,41 



max(l, |/i 2 ,3| 2 ), 

/ h |2 

= log2(l + (1^,31 + |/i 2 ,3|) 2 )+ log 1 + 



max(l, \h 2t3 \ 2 ) J 

Similarly, if \h c \ > | /*2,3 1 > 1*2,4 1, we consider the following "enhanced" channel 



K,3 _ 


\hc\ 


h,3 


1*2,3 1 


h' 

u 2,3 


\hc\ 


h 2 ,3 


1*2,3 1 



which is equivalent to lowering the noise variance at destination 3 from unity to \h 2>3 \ 2 /\hc\ 2 ■ 
Then, for \h 2 , 3 \ > 1, 



/ l\2\ ,i / 1 l*2,4| 



Ri + R2 < log2(l + (\h' h3 \ + \h' 2fl \) 2 ) + log 1 + 



max(l, \h' 2i3 
<log2(l + (|/n >3 | + \h 2 , 3 \) 2 ) + log [l+ ll '-' r 



* 



2,3| 



For \h 2>3 \ < 1, we upperbound the three terms directly 

\ E {HpsWlYt 1 ) ~ H(hl 3 (X 2 (t))\Yt\ h^iXt 1 ), hl 2 {X[-\ h^X*' 1 ), M u M 2 )} 



t=i 

T 



<^Hm))-H(N 3 (t)) 
1 t=i 



<log(l + (\h lfl \ + \h 2 , 3 \) 2 ). 

^E{#(n(f)l*i(*), f^(X 2 (t))) -HiY^lX^tlX^t))} 



t=i 
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1 *=1 

= I E + h2 A X 2 {t)) - H(N 4 (t)) 

1 t=i 

<log (l + \h 2A \ 2 ) 
< log 2. 



T 

1 t=i 



1 T 



< f J2H(hh(Mt)))-H(Ni(t)) 



t=\ 



< 



log(l + \hc\ 



max( 




2 


ft 


2,3 


2 

) 


1^2,4 


2 ) 


max(l, \h 2 ,3 : 


2 ) 



Thus, in general, we have 

Ri + R 2 < log2(l + (|/n, 3 | + IM) 2 ) + log + 
Upperbound J^: 

This is a simple cut-set upperbound [20] with nodes 1 and 4 on one side of the cut and 
nodes 2 and 3 on the other. It is easy to verify that 

R 1 <m a xI(X 1 ;Y 3 ,Y 2 ), 

PX 1 

R 2 < max/(X 2 ;r 4 ,^i). 

Under the linear deterministic model, this translates to an upperbound on the sum-rate of 

Ri + R 2 < max(ni )3 , n c ) + max(n 2i4 , n c ), 
and for the Gaussian case, we get an upperbound of 

R! + R 2 < log (l + |/i li3 | 2 + \h c \ 2 ) + log (l + \h 2A \ 2 + \h c \ 2 ) . 

Upperbound 5: 

This is also a simple cut-set upperbound. Nodes 1 and 2 are on one side of the cut and 
nodes 3 and 4 are on the other. The resulting upperbound on the sum-rate is 

Ri + R 2 < max I{X X] X 2 ; Y 3 , Y 4 ). 

Px 1 ,x 2 



For the linear deterministic case, this gives 
Ri + R 2 < 



max(ni j3 + n 2 , 4 , n 1A + n 2 , 3 ), if n lj3 - n 2j3 ^ ni )4 - n 2 , 4 , 
max(ni 3 , n 2 4 , n\ 4 , n 2 3 ), otherwise, 
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and for the Gaussian case, using the fact the eigenvalues of the input ([Xi,X 2 ]) covariance 
matrix cannot exceed 2, we may upperbound the sum-rate by 

Rx + R 2 < log ^1 + 2 (|/i li3 | 2 + \h 2A \ 2 + \h lA \ 2 + |/i 2 , 3 | 2 ) 

+ 4 (\hi )3 h 2A \ 2 + | ^-1,4^2,3 1 2 - 21/11,3/12,4/11,4/12,3! cos 9 

Upperbounds 1, 2, and 3 can be further tightened to obtain a smaller constant gap in 
Theorem |5] by (1) considering the correlation between the noise processes at the destinations, 
as well as (2) modifying the correlation of the Gaussian noise processes in the additional 
signals we provide to the destinations. Also, the correlation between the input signals can be 
explicitly accounted for instead of assuming the worst-case correlation at different stages as 
we do here. Upperbound 5 can be easily improved by choosing the optimal input covariance 
matrix. 



E Upperbounds for the Gaussian interference channel 
with feedback 



Below, we will show that (12) is an upperbound on the sum-rate. To see that this is the only 



biting upperbound in this regime, let us consider the achievability proof. The achievability 
proof in appendix [C] only depends on the marginals of the noises and hence holds without 
change for the feedback case. Hence, the sum-rate achieved in appendix O is achievable for 



the feedback problem as well. Moreover, for the symmetric case, u' A of (17) in the achievability 



proof (appendix |C|) is strictly subsumed by w' 5 of (18). Also, since for noiseless feedback, ric 
and n min as defined in appendix |C| are equal, u[ of (14) is subsumed by min(u 2 , u' 3 , u' 4 , u' 5 ) as 



we argued in that appendix (where u 2 through u' 5 are defined in (15 )-( 18 ). For the symmetric 



channel u'n 



tin 



Evaluating u' 5 and u 2 for the symmetric channel with noiseless feedback 



reveals that u' 5 is never smaller than u 2 by more than 1 bit. Thus, the achievability proof 
is appendix [C] when applied to the symmetric channel with noiseless feedback achieves a 
sum-rate of u 2 within a gap of at most 13 bits. Also, u 2 is within a gap of at most 5 bits 



from (12). Thus, overall, the achievability proof is appendix [C] achieves (12) with a gap of 



at most 19 bits. 



It only remains to show that (12) is an upperbound to the sum-rate. The line of argument 



is similar to the one in the proof of upperbound 2 for the cooperation case; the main difference 
is that the the genie does not provide h* l2 {X : [) to destination 4. Let us define h 23 (X 2 ) = 
h 2 $X 2 + N\ as before. Starting from Fano's inequality, we write 



T{R X + R 2 - o(e)) < /(Mi; Y 3 T ) + J(M 2 ; F 4 5 
<I{M Vl Y 3 T ) + I{M 2 -Yl 



</(Mi;F 3 T ) + /(M 2 ;F 4 



1 • 



u 2,3 

h* 
u 2,3 



{xl 



,Mi) 
\M X ) 
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< f; /(m i; isWi^ 1 ) + nw, ^(x^t))!^- 1 , ^(x*- 1 ), mo. 
t=i 

Below, we upperbound these terms separately. 

< H{Yz{t)\Yt l ) - H(Y 3 (t)\Y.t\Yt\M l ) 

< HftWYt 1 ) ~ H(hl 3 (X 2 (t))\Yt 1 ,hl 3 (X t 2 ~ 1 ),M 1 ), 

where (a) follows from the fact that Y 3 (t) = hi 3 Xi(t) + h 23 (X 2 (t)), and Xi(t) is a deter- 
ministic function fx >t of (Mi). 

= J(M 2 ; /^(X^))^" 1 , ^(X*- 1 ), MO + I(M 2 ; Y 4 {t)\Yt\ ^{X*), M,). 
We upperbound the above two terms separately now. 
J(M 2 ; /^(X^))^ 1 , ^^(X*- 1 ), MO 

= H(hl 3 (X 2 (t))\Yt\ hl^Xt 1 ), MO - H{hl 3 {X 2 (t))\Yt\ K^Xt 1 ), M u M 2 ) 

( ^ ) if(/ i * 2i 3(x 2 (0)|r 4 i - 1 ,/ i ; i 3(xr l ),M0-i/(X3(0), 

I(M 2 ;Y 4 (t)\Yt\hl 3 (X t 2 ),M 1 ) 

= H{Y A (t)\Yt\ h* 2j3 (Xl), MO - H(Y 4 (t)\Yt\ ^(X*), M u M 2 ) 

< i?(y 4 (t)|x 1 (t),/ i * i3 (x 2 (t))) - #(x 4 (t)), 

where (a) and (b) follow from (1) the channel model (memorylessness and independence 
of the noise processes at the two destination nodes, and Y±(t) = h 2 ^X 2 {t) + h\ 4 (Xi{t)) = 
h 2 AX 2 (t) + hi^Xiit) + N i (t)), and (2) the fact that Xi(t) and X 2 (t), resp., are deterministic 
functions of (Mi, ^ 2j 3(X 2 -1 )) and (M 2 , /i^ 4 (Xj -1 )), resp. Combining everything, we have 

T ( Rl + R 2 - o(e)) < £ {HOhWlYt 1 ) ~ H(N 3 (t))} + h*, 3 (X 2 (t))) - H(N 4 (t))} . 

t=i 

The rest of the argument is exactly as in Appendix [Dj 



F A Gaussian example 

The upperbounds follow from Appendix [Dj We can show that the sum-rate is upperbounded 
by all of the following for the symmetric channel with hj = yho- 

[/i = 21og(l + 2V> + 
u 2 = u 3 = \og2[l+(h D + Jh D ))[l + - L> ( ] 



u 5 = log 1 + A(h z D + h D )+4(h z D -h D 



h D 

2^ 
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Note that U\ above is slightly stronger than the one on Theorem [2j but follows directly from 
the proof in appendix [d] when specialized to the symmetric channel with hi = ^/hp. Also, 
we have left out 114 since this upperbound is not important for this channel. The above 
upperbounds imply that for any e > 0, the sum-capacity is upperbounded by C + e, for 
sufficiently large ho- 

The achievability again depends on different schemes depending on the regime. For 
he < Vh~£), we apply Theorem |4^a). We choose Z 1; Z 2 , U\, U 2 , V\, V 2 as independent, zero- 
mean Gaussian auxiliary random variables with the following variances. 







1 








h D 




< 




1 


, and 


< 




= 1 - 


1 

h D 2 



h 2 c 

W is independent of all these and has the same distribution as {Vi,V 2 ). X± and X 2 are as 
follows. 

X 1 = V 1 + U 1 + Z 1 , 

x 2 = v 2 + u 2 + z 2 . 

Evaluating the expressions in Theorem |4)^a) and simplifying using Fourier- Motzkin elimina- 
tion, it can be shown that the upperbound is achievable in the regime he < \fh~D within a 
gap of 6 bits for sufficiently large h 2 D . 

In the range of \fh~o < he < ho, we find that, for sufficiently large ho, u 2 = 

log 2 ^1 + (ho + Vh~oj \ (1 + hp), which is independent of he in this regime, dominates the 

other bounds. Thus, the achievability in the regime he < yho implies achievability in this 
regime as well. 

For he > ho, we apply Theorem |4](b) with the following choices for the auxiliary random 
variables. S[, S' 2 , Zi, Z 2 , Si, S 2 ,Vi,V 2 are independent, zero- mean Gaussian auxiliary random 
variables with the following variances. 



2 

a S[ 


2 

— <J 01 — 
a 2 


1 

W 


2 


= 4 2 = 


1/2 

h D ' 




= °k = 


1 


2(l + 




= < = 


K 1 - 





and 
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W is independent of all these and has the same distribution as (14,1^), and U\,U 2 are 
constants. X\ and X 2 £1X6 cLS follows. 

X 1 = V 1 + S 1 - ^S 2 + Z X + S[, 
hx 

X 2 — V 2 + S 2 — 7 — Si -\- Z2 -\- S 2 . 

Y 3 = h D (V, + Z 1 + S[) + h x (V 2 + Z 2 + S' 2 ) + (h D + N 3 , 

Y 4 = h D (V 2 + Z 2 + S' 2 ) + h x (Vi + Z 1 + S[) + (h D -^js 2 + N 4 , 

Evaluating the expressions in Theorem |4](b) and simplifying using Fourier-Motzkin elimina- 
tion, it can be shown that the upperbound is achievable in the regime he > hp within a gap 
of 5 bits for sufficiently large ho. 
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